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Introduction 


About this book 


This book has been written to cover the Cambridge AS & A Level 
International Mathematics (9709) course, and is fully aligned to the 
syllabus. 


In addition to the main curriculum content, you will find: 


e ‘Maths in real-life, showing how principles learned in this course are 
used in the real world. 


e Chapter openers, which outline how each topic in the Cambridge 
9709 syllabus is used in real life. 


The book contains the following features: 


Did you know? 


Advice on 


EXAM-STYLE QUESTION 
calculator use 


Throughout the book, you will encounter worked examples and a host 

of rigorous exercises. The examples show you the important techniques 
required to tackle questions. The exercises are carefully graded, starting 
from a basic level and going up to exam standard, allowing you plenty of 
opportunities to practise your skills. Together, the examples and exercises 
put maths in a real-world context, with a truly international focus. 


At the start of each chapter, you will see a list of objectives that are covered in 
the chapter. These objectives are drawn from the Cambridge AS & A Level 
syllabus. Each chapter begins with a Before you start section and finishes 
with a Summary exercise and Chapter summary, ensuring that you fully 
understand each topic. 


Each chapter contains key mathematical terms to improve understanding, 
highlighted in colour, with full definitions provided in the Glossary of 
terms at the end of the book. 


‘The answers given at the back of the book are concise. However, you 
should show as many steps in your working as possible. All exam-style 
questions, as well as Paper A and Paper B, have been written by the author. 


About the authors 


James Nicholson is an experienced teacher of mathematics at secondary 
level; he has taught for 12 years at Harrow School and spent 13 years as 
Head of Mathematics in a large Belfast grammar school. He is the author 
of two A Level statistics texts, and editor of the Concise Oxford Dictionary 
of Mathematics. He has also contributed to a number of other sets of 
curriculum and assessment materials, is an experienced examiner and has 
acted as a consultant for UK government agencies on accreditation of new 
specifications. 

James ran school workshops for the Royal Statistical Society for many 
years and has been a member of the Schools and Further Education 
Committee of the Institute of Mathematics and its Applications since 2000, 
including six years as chair; he is currently a member of the Community of 
Interest group for the Advisory Committee on Mathematics Education. He 
has served as a vice-president of the International Association for Statistics 
Education for four years, and is currently Chair of the Advisory Board to 
the International Statistical Literacy Project. 


A note from the author 


The aim of this book is to help students prepare for the Statistics 1 unit of the 
Cambridge International AS & A Level Mathematics syllabus, although it 
may also be useful in providing support material for other AS and A Level 
courses. The book contains a large number of practice questions, many 

of which are exam style, in addition to questions from past Cambridge 
examinations papers. 

In writing the book I have drawn on my experiences of teaching 
Mathematics, Statistics and Further Mathematics to A Level over many 
years, as well as on my experience as an examiner and discussions with 
statistics educators from many countries at international conferences. 
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Syllabus: Cambridge International AS & A Level 
Mathematics: Probability and Statistics 1 (9709) 


PROBABILITY & STATISTICS 1 Student Book 


Syllabus overview for 9709, first examined in 2020. 

Probability & Statistics 1 (Paper 5) 

1. Representation of data 

* Select a suitable way of presenting raw statistical data, and discuss advantages and/or Pages 34-59 


disadvantages that particular representations may have 
¢ Draw and interpret stem-and-leaf diagrams, box-and-whisker plots, histograms and Pages 34-59 
cumulative frequency graphs 
¢ Understand and use different measures of central tendency (mean, median, mode) and Pages 14-25 
variation (range, interquartile range, standard deviation), e.g. in comparing and contrasting 
sets of data 
e Use a cumulative frequency graph Pages 45-48 
© Calculate the mean and standard deviation of a set of data (including grouped data) either Pages 26-29 
from the data itself or from given totals such as ©x and Yx?, or coded totals ¥ (x - a) and 
(x - a)’, and use such totals in solving problems which may involve up to two data sets. 


2. Permutations and combinations 


¢ Understand the terms permutation and combination, and solve simple problems involving Pages 98-111 
selections 
© Solve problems about arrangements of objects in a line, including those involving Pages 98-111 


— repetition (e.g. the number of ways of arranging the letters of the word ‘NEEDLESS’) 
— restriction (e.g. the number of ways several people can stand in a line if 2 particular 
people must — or must not — stand next to each other) 


3. Probability 


e Evaluate probabilities in simple cases by means of enumeration of equiprobable elementary Pages 63-67 
events, or by calculation using permutations or combinations 

¢ Use addition and multiplication of probabilities, as appropriate, in simple cases Pages 67-71 

e Understand the meaning of exclusive and independent events, including determination Pages 74-82 
of whether events A and B are independent by comparing the values of P(A nm B) and 
P(A) x PB) 


* calculate and use conditional probabilities in simple cases. Pages 71-74 
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4. Discrete random variables 


Construct a probability distribution table relating to a given situation involving a discrete 
random variable variable X, and calculate E(X) and Var(X) 

Use formulae for probabilities for the binomial and geometric distributions, and recog- 
nise practical situations where these distributions are suitable models 


Use formulae for the expectation and variance of the binomial distribution and for the 
expectation of the geometric distribution 


Pages 84-96 


Pages 115-120; 
122-131; 133-136; 
139-145 
Pages 120-122; 
137-139 


5. The normal distribution 


Understand the use of a normal distribution to model a continuous random variable, and 
use normal distribution tables 


Solve problems concerning a variable X, where X ~ N(y, 0°), including: 

— finding the value of P(X > x,), or a related probability, given the values of x,, 1, o, 

— finding a relationship between x,, 1 and o given the value of P(X > x,) or a related 
probability 

Recall conditions under which the normal distribution can be used as an approximation 

to the binomial distribution (n large enough to ensure that np > 5 and ng > 5), and use 

this approximation, with a continuity correction, in solving problems 


Pages 147-171 


Pages 147-171 


Pages 173-179 


Introduction to statistical thinking 


Statistical data have been recorded 

for most of the time civilisation has 
existed. The Romans gathered a lot of 
information on the population size 
and wealth of nations in their empire. 
They usually carried out a census every 
five years, which required citizens 

to register their duties and property; 
this was then used to calculate taxes. 
Modern statistics, which attempts to 
do more than just record information, 
dates back to the seventeenth century. 


Objectives 


e Describe the role of statistics in modern society. 


e Identify types of data: primary and secondary, categorical and numerical, discrete and 
continuous. 


1.1 What is statistics and why is it important? 


We live most of our lives relying on only partial information to make decisions. 
Some of the information we use is numerical, other information can be 
summarised numerically. 


As we don't often have access to all the information when making decisions, 
statistics is largely about trying to make the best use of the data we have at 


any given time. If we have a better understanding of the information we Body temperature measurements 
have available, this generally helps us to make better decisions because 104 
i: ted c 4 
we have a better understanding of the risks involved. = 
2 102 -——+ 
Consider the graph on the right, which shows the body E fee | 
temperature readings for 20 people. Each person has had — 100 To 
& > 
their temperature taken with a mouth (‘oral’) and an ear © opl-Saet. 
= 
(‘tympanic’) thermometer. E 
, ™ 96 
Notice that for each person there are variations between the oral 96 98 100 102 104 
and tympanic readings, even though they claim to be testing the Oral temperature (°F) 


same value (body temperature). 
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The next graph shows the temperatures of the same 20 people but Body temperature measurements 


now the gender of each person is identified. We see that there iseven = = 1 7 
more variability: it appears that males and females may have different © 102 | 4 
body temperature characteristics. = . 
2 100 
We are surrounded by variation — essentially the unpredictability of life. 5 
Statistics is a tool for helping us to make sense of the partial information §§ 98 wae 
8 > 
we have available and make more informed decisions. We already do & = Female 
‘ aes : ae: 96 
this naturally and informally in many everyday situations. ae. Oe. nd aloe 
Consider the following examples involving uncertainty: Gral temperature (°F) 


Example 1 


The insurance industry is based on a business contract that is beneficial to both 
the individual and the company. Why do we have insurance? 


It is good for us: a big loss may be unlikely to happen, but we couldn’ afford it if it did. 


It is good for the insurance companies: the large number of premiums they take in pay 
for the small number of claims made, which means they will make a profit. 


Although insurance involves uncertainty for both parties, statistics can be used to calculate 
the potential risk and rewards involved. 


Example 2 


An elevator should have a safety notice giving the number 
of people and the weight it is able to accommodate safely. 
How should the company who makes the elevator calculate 
these? 


The number of people allowed in the elevator at one time is only a proxy for the important 
variable — the total weight. The company who makes the elevator will have to assume an average 
weight for each person; however, their weight could vary dramatically. The reality is that all 
mathematics of combining distributions depends on assumptions of independence, and we 
must keep the presence of variation in mind. 


What is statistics and why is it important? 


Did you know? 

This idea of a proxy is key in medical tests, for example, where measuring something 
directly is very expensive or requires invasive surgery. If something else is found to be 
closely associated and is easier to measure, then that is often measured instead. 


Exercise 1.1 


Consider the following situations, where there may be a considerable 
degree of variation. 


1. Why is a knockout competition much more likely to produce a 
‘surprise’ winner than an extended league format? 


Knockout competition League | Player A | Player B | Player C | Player D 


((Piayera } (Players) {Player | (_PlayerD ) Player A 


Player B 
SS = 

Player D 
The knockout competition may 


produce a winner, but does this mean 
that they are statistically the best player? 


2. A haulage firm has to quote a price for a job in which 
one of the major costs is the length of time the journey 
will take. They have to quote in advance, sometimes quite 
a long time in advance, but don’t know how long the 
journey will take until it happens. What advice would 
you give to them? 


3. The government are to set up an enquiry into health and 
safety issues relating to the positioning and operation of 
mobile phone transmitters. What factors should they 
be looking at? 
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Apart from the presence of variation, we need to be conscious 
that different contexts can tolerate different amounts of variation. 


For example: 


A newspaper does not need the same precision in 


printing as a banknote. 


Exercise 1.2 


We know from experience that when a die is 
thrown repeatedly, we will see different scores, 

but we usually don’t think about this systematically. 
It is often difficult to judge how the scores of a die 
will vary, as we will see in the following exercise. 


Carry out the following experiments, and then 
consider the questions: 


A Tossa fair coin 10 times and count the number 
of heads that are seen. Repeat this at least 5 times. 


B_ Throw a fair die 20 times and record the results 
using a tally chart. Make a table showing the 
frequencies of each score: 1, 2, 3, 4, 5 and 6. 

Repeat this so you have at least 3 sets of frequencies. 


C_ Tossa fair coin and count the number of tosses until 
a tail is seen. Record ‘1’ if a tail is seen on the first 
toss, record ‘2’ if a tail is seen on the second toss, 
and so on. Repeat this at least 10 times. 


What is statistics and why is it important? 


‘There can be more variation in the 
manufacture of generic travel sickness 
tablets than of powerful chemotherapy 
drugs. 


—,° 


‘Tragic consequences may follow ifa 
change in the quality of steel girders 
produced for use in bridges or large 

buildings is not immediately picked 

up and rectified. 


1. When a fair coin is tossed 10 times, we will on average see five heads. 
Did you get five heads most of the time for Experiment A? 
What proportion of the time would you expect to get five heads? 


2. When a fair die is thrown 20 times, we will not see all the possible 
outcomes an equal number of times. We will see each score ‘on average’ 
between three and four times. When you did Experiment B, did all the 
possible scores occur three or four times? Were there any scores which 


did not come up at all in a complete group of 20 throws? 


Ifa die is rolled 20 times, what proportion of the time should we expect 
to produce a zero frequency for a particular score - i.e. for a particular 
score to not be rolled? 


3. When a fair coin is tossed, we will see a tail on the first toss about half the 
time, so it is probable that a number of 1s were seen in Experiment C. 


Consider the largest number you have in your list. If this experiment 
was repeated many times, how often should we expect to see a 4 
appear in the list? Is it possible that 100 would appear? 


Sampling 


If you go on to study $2 or work with statistics later, you will encounter sampling 
theory. You have probably considered simple aspects of this already in your study 
of statistics - the need to avoid bias, for example. So why do we use samples? 


e To save time and money. Often sampling will give us a large proportion 
of the total information for a fraction of the cost ~ certainly a law of 
diminishing returns applies to increasing the sample size. 


e It is often not possible to have complete information about a population, since 
© some tests involve the destruction of the item 
© testing may be very expensive 
0. there just may not be time. 


The purpose for which the information is to be used influences how accurate we feel 
the information needs to be. Compare, for example, information which is needed 


e to assist marketing strategy 


e to monitor health and safety. 


Consider the following situations: 
e Anairline has 243 passengers booked on the last flight of the day, from Sydney to Seoul. 


© Ifthe airline has to cancel the flight, how many of those passengers 
will want to travel to Seoul the next day? 


o How many people will the airline have to provide overnight accommodation for? 
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It is not realistic to expect to know in advance exactly how 

many of the 243 passengers will want to travel the next day, 
or how many will want overnight accommodation. However, 
if the airline has some historical data on what has happened 
in similar circumstances on previous occasions, it can start 

making contingency plans. Many airlines now ask whether 

the passenger is travelling for business, to visit family or for 
leisure when they book tickets. 


Probability distributions will help you understand what 

happens in these situations. 

e A rare but very serious disease occurs in 1 in 10000 people. There is a 
screening test which gives a positive result in 99% of cases where 
the subject has the disease and gives a negative result in 99% of 
cases where the subject does not have the disease. How reasonable is it 
to tell a patient whose test comes back positive that they have this 
serious disease? 


Even though this screening test is extremely accurate, both for those with 
the disease and those without, for every person whose positive result 
occurred because they had the disease, there will be around 100 people 
who gave a positive result without having the disease - because the disease 
is so rare. 


Conditional probability will help you understand what the implications 
of a positive result in a situation like this actually are. 


e ‘There is substantial evidence that shows that students in 
large classes in the UK have better examination results than 
students in small classes. Would it be reasonable for the UK 
government to decide that all students should be taught in 
large classes, in an effort to improve educational standards? 


This probably would not be a good idea — schools put students 
who are more capable academically into larger classes, and 
their academic ability is the reason they get better results. 


This is an example where simply observing what is going on, instead of 
conducting a designed statistical experiment, could be very misleading. 


1.2 Types of data 


Data can either be quantitative (numerical) or qualitative (non-numerical). 


Categorical data are data which can be divided into distinct categories, and 
can be either qualitative or quantitative. 


Types of data 


The fruit on this trader's stall can be described by its properties. 


For example: 


the wumber of that 


The type of fruit type of fruit 

The colour the weight of a 

How ripe it is plece of fruit 

These are qualitative The cost of the fruit 
data - non-numerical, ‘These are quantitative 
categorical data data - numerical 


here are two types of quantitative data. 


Quantitative data can be either discrete (limited to particular values) or 
continuous (any value within a range). 


he number of pieces of fruit can only be one of a list of 
values (0, 1, 2, ...). For example, these 5 oranges are an 
example of discrete data. 


‘The weight of the oranges can be read from the scales as 
400 grams. If you measured it more exactly, however, 
you might find it is 1396.2 g (1 d.p.). 

his is an example of continuous data - it will always 
be reported as a rounded value. 


Example 3 
What data about the radio are given in the advertisement? 


For each piece of data, state whether it is qualitative or 
quantitative, and, if quantitative, whether it is discrete 
or continuous. 


25 em = 20 em = 5 cm 
Colour Available in black or silver 
Mains/battery Mains, or 4 « AA batteries 


> Continued on the next page 
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Power: quantitative, continuous 

Wavebands: qualitative 

Number of batteries needed: quantitative, discrete 
Price: quantitative, discrete 


Colour: qualitative 
Length, width, height: quantitative, continuous 


For a large number of data observations, a list is not easy to make sense of. 


A frequency table of values can be used to summarise a large list. 
Sometimes it is helpful to use groups of values so the data are more 
easily understood. 


Example 4 

A group of 45 women go on a skiing 
holiday. 

Summarise the following data in frequency 
tables. 


a) The number of children each woman has: 
Wy de 0 8 al 
Pe a ha al ee i iy Si 
2s) 0) 0 2 ae 3 01s 0: 


b) The women’ ages (use class intervals of 15-19, 20-24, ..., 45-49): 
22) 26° 26) 4526. 27) 17) §29) 435") 38) 925.923) 17" 38) 48 
Prey AGN aay ee) IE) Wei sis 2s Aha Pie Tae Pf AR PIs} 
28) (23) S35) 24, 295037 20) 30) 44" 250 19) 32) 22-22) Se 


> Continued on the next page 
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a) b) 


Number of Age Frequency 
‘ Frequency 
children 15-19 6 
0 12 20-24 7 
1 21 25-29 15 


Example 5 

Consider the marina at Monte Carlo; 

list at least four pieces of relevant data that 
might be collected. Include the type of data, 
and how it may be collected. 


Examples of data that could be collected for this situation include the number of boats in the 
marina at any time (quantitative, discrete) and, for each boat 
e the colour (qualitative) 

the length (quantitative, continuous) 

the age of the boat in years (quantitative, discrete) 

the name (qualitative) 

the value (quantitative, discrete - though very many values are possible). 
The number of boats and colour could simply be observed. The length could be measured, though 
perhaps only approximately if you can’t get permission to board the boat. The age of the boat, 
however, might need sight of a register not freely available and the value can probably only be 
estimated by making comparisons with other similar boats sold in the recent past - and Monte 
Carlo marina probably has some boats in it for which there are very few comparable boats, 
making the estimate difficult. Colour, name and length are potential primary data examples, while 
the year of commission and the value are likely to be available as secondary data only. 
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Primary data is any data which you collect yourself — it may be measurements but 
can also be through questionnaires or other surveys, investigations and experiments. 


Secondary data has been collected by someone other than the person using the 
data —- which may be in the form of the raw data, but it also includes databases, 
published statistics, newspapers etc. 


The internet has a huge amount of secondary data freely available for 
people to access, for example National Statistics, but also data from 
organisations such as the Guinness Book of Records are available online. 
With secondary data it is important to know that it was collected reliably, 
and that the data matches what you want to investigate — if it does not 
have all the information you need, you will not be able to use it. 
However, when it is available, secondary data saves a lot of time and 
effort. A lot of data is now collected automatically (data logging) 

using some form of technology. 


Example 6 


Amir is concerned about the traffic passing his house. One morning, he 
spends an hour counting the cars that pass his house. He then hears that 
the council have installed a sensor which automatically registers when a 
car goes past. Amir gets a summary of the traffic in the past six months 
from the council. 


‘The first set of data described is primary because Amir collected it himself. 
The second set is secondary data because it was collected by someone else 
(the council) in a data logging process. 


Exercise 1.3 
1. The number of days each student in a class was late during one 
week was recorded. Summarise these data in a frequency table. 
0 il 0 0 1 3 0 0 0 0 
0 5 1 0 1 0 0 0 0 2 
0 1 1 0 0 0 1 2 0 0 
2. The depth of water (in feet) at high tide was recorded at 
San Francisco, Golden Gate, for a period of 15 days in January 2012. 


Summarise the data, listed here, in a grouped frequency table. 
Use appropriate class intervals. 


55 3.6 56 36 57 39 58 41 60 43 


62 45 63 47 64 49 64 50 63 5.2 
61 54 57 56 52 58 46 459 41 


Types of data 


For each of the following situations, 
list at least four pieces of relevant data 
that may be collected. Include at least 
one type of primary data and one type 
of secondary data, stating what type it 
is and how it may be collected. 


Aclass can work on this part of the exercise in groups, each 
group looking at a given number of questions. If working on 
your own, try the first three or four questions and then compare 
your answers with the answer section — the suggestions listed 
there are unlikely to be the same as yours, but you can make 
sure your list contains the same sorts of things. 


a) Some cars for sale on a showroom forecourt. b) A motorcycle road race. 


d) A river in New Zealand. 
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g) The snow that fell in a snowstorm. h) A baseball player. 


Chapter summary 

e Data can either be quantitative or qualitative. 

e Quantitative data is numerical in kind. 

e@ Qualitative data is non-numerical in kind. 

e@ Quantitative data can either be discrete (limited to particular values) or continuous 
(any value within a range). 

e A frequency table of values can be used to summarise a large number of data observations. 
Groups of values may be used so the data are more easily understood. 


Types of data 


2 Measures of location and spread 


sl Frank Anscombe (1918-2001) was a British 
statistician who illustrated brilliantly why we should 
be careful not to rely too heavily on summary 
~ statistics. The four data sets shown here, despite being 
very different, have identical summary statistics - the 
mean and variance of x and y are the same, and they 
have the same line of best fit and correlation. Extreme 
values can get lost in summary statistics but often have 
a huge impact - a freak storm for instance. 


Objectives 

e Understand and use different measures of central tendency (mean, median and mode) and 
variation (range, interquartile range and standard deviation), for example when comparing 
and contrasting sets of data. 


e@ Calculate the mean and standard deviation of a set of data (including grouped data), either 
from the set of data itself or from given totals such as Me and ea or Ye —a)and Ye -a). 


Before you start 


You should know how to: Skills check: 
1. Calculate the mean, median, range and mode 1. For the data 3, 5, 2, 0, 2, 1, 3, 2, 0, 
of a simple set of data, calculate 


e.g. 7, 5, 3, 4, 7,5; 7, 5, 9, 6, 5. 


ae ‘ . a) the mean 
The median is the middle value with the data 


in order: 3, 4, 5, 5, 5,5, 6, 7, 7, 7, 9. Here the b) the median 

median is 5. c) the range 

Range = highest value — lowest value: d) the mode. 
Range:s?'=.3 =6; 2. For the data 7, 8, 6, 6, 6, 8 


The mode is the value which appears most 


a calculate 
often: Here the mode is 5. 


a) the mean 


Mean = sum of values 
~ numberof values b) the median 
= 2+54+34+44+74+54+74+549+6+5 _ 63 c) the range 
11 11 
=5.7 (ldp.) d) the mode. 
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2.1 Averages 


Remember 
e the mean is the sum of all the values divided by the number of values 


e the median is the middle value when the values are arranged in order 


e the mode is the most commonly occurring value. 


Example 1 


A group of students were given a short mental arithmetic test. Their scores were 
7, 6, 7, 5, 6, 8, 5,7, 8, 9. 


For this group, calculate 
a) the mean b) the median c) the mode. 


deeeeeeeeeecerassccecscessessesseseesensssesssessenenssssseesnensenssesesssessassesessssssessssesssssassseneed 


a) The sum of the ten scores is 68, so the mean is s = 6.8. 


b) In order, the values are: 5, 5, 6, 6, 7, 7, 7, 8, 8,9. 
The median is the middle value. Here this is halfway between the 5th and 6th scores, 
which are both 7, so the median is 7. 

c) The score 7 occurs more than any other, so the mode is 7. 


Data in a frequency table 


‘The frequency table on the right shows the number of children that 
45 women have. 


Number of | Frequency 


[o find the mode, look for the greatest frequency. Here, 
he category for one child has the greatest frequency, 21. 
Therefore, the mode is one child. 


To find the median, look for the middle value. Here there are 
45 values, so the median is the 23rd value, with the values 
listed in order. 


Make a running total of the frequencies: 


e there are 12 women with no children 


e there are 33 women with no children or one child. 


The 23rd value, or median, is therefore one child. 


Averages 


To calculate the mean, construct a table of values with a further column 
showing value x frequency: 


Number of | Frequency xf 
children f 
x 
0 12 0 
1 21 21 
3 3 9 
4 0 0 
5 1 | 5 
Yre4s | Vxf=5i 


he mean number of children is 3 =1.1(1d.p.). 


4 


o 


ata in a grouped frequency table 


‘The ages of the same group of 45 women are shown in the grouped frequency table. 
Age (years) | Frequency 
15-19 6 
20-24 7 
25-29 15 
30-34 6 
35-39 8 
40-44 1 
45-49 Ze 


The modal class is 25-29 years, as this is the class with the greatest frequency. 
The median is the 23rd value, with the values listed in order. 


Make a running total of the frequencies: 

e there are 6 women less than 20 years old 

e there are 6 + 7 = 13 women less than 25 years old 

e there are 13 + 15 = 28 women less than 30 years old. 


The median age therefore lies in the 25-29 age group. 
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To calculate the mean you need to add up all the values, but because the 
frequencies are grouped, you need to use the value at the mid-point of the 
interval to make an estimate: 
Age Frequency | Mid-interval mf 
(years) f value 
m 

15-19 6 17.5 105 

20-24 7 22.5 157.5 

25-29 15 27.5 412.5 

30-34 6 32.5 195 

35-39 8 37.5 300 

40-44 1 42.5 42.5 

45-49 2 47.5 95 

dps © mf = 1307.5 


The estimated mean age of this group of women is therefore 
Bes 5 = 29.1 (1 d.p.) - that is, about 29 years old. 


Exercise 2.1 


1. A golfer keeps a record of his scores in club competitions during one year. 
They are 


75, 77, 77, 74, 79, 76, 84, 76, 75, 77 
Calculate 
a) the modal score 
b) the median score 


c) the mean score. 


2. A vet keeps a record of the number of kittens in litters Number in | Frequency 
produced by cats in his practice. litter 
Calculate ib 2 
a) the modal size of a litter 2 4 
b) the median size ofa litter 3 7 
c) the mean size of a litter. 4 in 
E} 8 
6 4 
7 2 
8 dl. 


item Averages 


3. A large group of teenagers took part in an exercise session. Their pulse rates 


were measured before the session began and then measured again after a 
series of warm-up exercises. The two sets of data are given below. 
For each condition, calculate an estimate of the mean pulse rate for the group. 
Use your results to compare the distributions. 
Pulse rate before warm-up | Frequency Pulse rate after warm-up | Frequency 
(beats per minute) (beats per minute) 
60-69 5 80-89 9 
70-79 8 90-99 12 
80-89 22 100-109 25 
90-99 29 110-119 17 
100-109 13 120-129 9 
110-119 5 130-139 Z 
120-129 0 140-149 3 


4. The table shows information about the height of plants of a particular species. 


Height (cm) | 20-50 | 50-60 | 60-65 | 65-70 | 70-80 | 80-90 | 90-110 

Numberet | on | ag | te | oe | oe | | oe . 

plants | 
Calculate an estimate of the mean height of these plants. »% 


5. Students recorded the length of time they spent Time (minutes, correct to | Frequency 
travelling to school on a particular day. the nearest minute) 
A summary of the results is shown on the right. 1-15 115 
Calculate an estimate of the mean time it took 16-25 | 46 
students to travel to school that day. 26-35 | 36 
36-55 22 
6. ‘The table shows information about the salaries 5 
paid to employees in a company. S80 i“ 
Salary Frequency 
$0 <x < $10000 7 
$10000 <x < $15000 82 
$15000 < x < $20000 45 
$20000 < x < $25000 24 
$25.000 < x < $30000 13 
$30000 < x < $50000 4 


Calculate an estimate of the mean salary of employees in the company. 
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2.2 Quartiles and the interquartile range 


A useful way of organising data is to divide them into quarters, which are known as quartiles. 


These dots represent 19 values: 


Q, is the lower quartile. 


For a data set with m values Gs Digi x); calculate in. 


If tn is an integer r then Q, is the mid-point of x, and x,,,. 


fin lies between rand r+ 1 then Q is x,,,.. 


ee ©00 e8 e000 08 cee ceo 
1 1. 


0 10 20 30 40 


7 x 19 = 4.75, so Q, is the Sth value. 


Q, is the median. 
To find the median, calculate ; n. 


If $ nis an integer r, then Q, is the mid-point of x and x... 


ifSn lies between rand r + 1, then Q, is x,,,. 


% 
ee e080 ee e800 #8 e000 eee 
es | 
10 20 30 40 


Nie eo 


x 19 = 9.5, so Q, is the 10th value. 


Q, is the upper quartile. 
To find the upper quartile, calculate fn. 


If Sn is an integer r, then Q, is the mid-point of x, and x,,,. 


If on lies between r and r+1, then Q, is x,,. 


% 
ee e080 ee eee e8 e200 coo 


0 10 20 30 40 


Quartiles and the interquartile range 


2 X 19 = 14.25, so Q, is the 15th value. 


The range = maximum — minimum =(Q, - Q,) 
The range indicates the spread of the values. 


% 4 
Ce e080 e8 e000 e8 e008 880 
1 1 A 


t 4 
0 10 20 30 40 
« > 


The interquartile range (IQR) = Q, - Q, 
The IQR indicates the spread of the middle 50% of values. 
Q, Q, 
ee ee0 ee see00e e¢ ecoVe eee 
t . : A 1 


0 10 20 30 40 
rs 


The semi-interquartile range is simply half of the IQR, and is sometimes 
used as an alternative measure of spread. 


Example 2 

‘The scores of a group of students in a short mental arithmetic test (from Example 1) were, in order, 
931050510575) 7517.5, 8555 9= 

Calculate 

a) th 


a) n=10,so in = 2.5 and Q, is the 3rd value in the ordered list. Q, = 6. 
$n = 5,80 Q, is the mid-point of the 5th and 6th values in the ordered list. 


These are both 7, so the median is Q=7. 
3n = 7.5, so Q, is the 8th value in the ordered list. Q, = 8. 


b) IQR=8-6=2. 


Exercise 2.2 
1. A golfer keeps a record of his scores in club competitions 
during one year. They are 
75, 77, 77, 74, 79, 76, 84, 76, 75, 77 
Calculate the lower quartile and the upper quartile for these data. 
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2. A vet keeps a record of the number of kittens in litters Number in | Frequency 
produced by cats in his practice. litter 


Calculate the lower quartile and the upper quartile for these data, 1 


shown in the table on the right. 


3. A group of students recorded their pulse rate when they 
were in a relaxed, restful state. 


53 54 55 55 57 58 59 59 59 62 62 62 
62 62 63 63 63 64 64 65 65 67 67 67 
68 68 69 69 70 71 71 73 75 79 80 83 


e 
Bly} plole ||| 


2 
3 
4 
Their results (in beats per minute), were: 5 
6 
7 
8 


Calculate 
a) the median 
b) the interquartile range of these pulse rates. 
4. The ages of a group of 45 women are: 
17 17 17 19 19 19 20 22 22 22 23 23 24 25 25 
25 26 26 26 26 27 27 28 28 28 29 29 29 30 32 
32 32 34 34 35 35 35 35 36 37 38 38 44 45 48 
Calculate 
a) the median 
b) the interquartile range of their ages. 
5. Ata metro station, a regular passenger records how long (in seconds) he has 
to wait for a train to arrive once he gets to the platform. His results are: 
87 42 0 62 124 0 58 37 74 94 
182 23 17 62 29 17 82 54 0 45 
Find 
a) the median 
b) the interquartile range of these times. 
6. ‘The length of time, in minutes, that customers spend in a coffee shop is recorded. 
‘The results are: 
17, 15, 9, 31, 33, 41, 8, 14, 13, 22, 27, 43, 32, 14 
Find 
a) the median 


b) the interquartile range of these times. 


Pi Quartiles and the interquartile range 


2.3 Variance and standard deviation 


The mean of a set of data is given by the formula 


x 
Mean =x = when data is in a list, or 
n 


def 


=— when data is in a frequency table, 

2s 
where x is each data value, n is the number of data values and fis the 
frequency. 


Remember the greek letter sigma, be stands for ‘the sum of” 


So far you have met two measures of spread: the range and the interquartile range. 
Each of these measures uses only two values. 


Another measure of spread, one which uses all the data values, is the variance. 


‘The variance of a set of data is defined as the average of the squared distances 
from the mean. 


. x— xX) 
Variance = ry or 


The standard deviation is the square root of the variance. 


It is a measure of how spread out the set of data is, and it is in the same 
units as the data. 
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Example 3 
Find the mean and standard deviation of this set of data 
11, 13, 14, 16, 18. 


Sees eer 2, 

so the mean is ¥ = 2 = 14.4 

ee =1P 413+ a + 16° + 18° = 1066, 
so the variance is o = 14, 4° =5.84 


and the standard deviation is 


o= V5.84 = 2.42 (3 sf). 


(—) You can use your calculators statistical functions to get the mean and 
Bega standard deviation. Your calculator will normally have two versions, which 
give slightly different answers. 


‘The population standard deviation is the one that you want, and is 
usually shown as Gor ,, on a calculator. 


‘The other version is sometimes referred to as a sample standard 
deviation, and is usually shown as s or o,_,, but you don’t need to work 
with it at the moment. 


nV? 


Example 4 
The number of children, x, that each of a class of 30 students has in their family was recorded. 


Find the mean and standard deviation of the number of children in a family for this set of data, 
given that S ¢ = 66 and SS = 165. 


PPrrreeterr tree eterrerrrrrrrrirs Prrereer errr err erererrctreerrrrrrecrrerrerrer rit trrreer errr rrrererrrrrrr ey | 


Mean: x = £8 = 2,2 
30 


2.27 = 0.66 


Variance: 0? = in 


So standard deviation is o = ¥0.66 = 0.812 (3 s.f.). 


Example 5 


The number of putts, x, a golfer takes on each of the 18 holes in a round is recorded. 
Find the standard deviation, o, of the number of putts he takes on each hole, given that 


D(«-zY = 4.944, 


Sd eee eeeeeeeesenancecessneneeneeesesesnssaeseseseneeenseseeseneseaaessssseeesenssessessesesesessesencssssesend 


A944 
18) 


and the standard deviation is o = /0.2746... = 0.524 (3 s.f.). 


There were 18 holes, so the variance is 07 = = 0.2746.. 


Variance and standard deviation 


Example 6 


The heights of 142 plants are recorded in the table, where, for example, the class 65-70 
means at least 65cm and less than 70cm tall. 


Class 55-60 | 60-65 | 65-70 | 70-75 | 75-80 | 80-85 | 85-90 


Frequency 2 | 11 37 54 28 9 il 


Calculate estimates of the mean, x, and variance, o”, for the height of the plants. 


You need to calculate the mid-point of each class, and a table of values is the simplest way to do this: 


Class | Mid-point Frequency mf nef 
m 


55-60 57.5 115 6612.5 


60-65 62.5 687.5 42968.75 
65-70 67.5 2497.5 168581.3 
70-75 72.5 3915 283 837.5 
75-80 77.5 2170 168175 
80-85 82.5 742.5 61 256.25 
85-90 87.5 87.5 7656.25 


> fe 142 Yimf = 10215 Ym’ f =739087.5 


10215 _ 
i= 71.9 (to 3 s.f.) 


and the variance is o? = 220825 _ (ets 


: 
142 142 = B00 Cay). 


What does ‘standard deviation’ actually mean? 


A good visual idea of standard deviation is to imagine a histogram of the 
data. For a roughly symmetrical distribution, a spread of 4 standard 
deviations covers approximately the central 95% of the distribution. 

y 


2 
2 
5 
3 
> 
5 
3 
S 
2 
é 
x 
25 35 45 55 65 
<—_ 
4 standard deviations 
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Exercise 2.3 
1. For each set of data, calculate the mean and standard deviation. You should do 
this using the formulae, and check using your calculator. 


a) 12 17 11 8 6 18 14 17 
11 15 16 18 9 15 20 14 


[x] 4]5]|6 7/8 |9 


12 | 18 35 | 28 | 16 | 9 


2. The wingspans of 352 Great Tits were measured. The information is 
summarised in the table. 


Wingspan (cm) | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 
Frequency | 4 | 19 | 53 | 77 | 23 | 75 | 53 | 30] 12] 6 


Calculate the mean and standard deviation of the wingspan of a Great Tit. 


3. The African Grey Parrot is an endangered species of small bird found in Africa. The weights 
(in grams, to the nearest gram) of 210 African Grey Parrots were measured during 2014. The 
information is summarised in the table below. 


Weight (grams) | 300-319 | 320-339 | 340-359 | 360-379 | 380-399 
Frequency | 3 | 45 149 u | 2 


Calculate estimates of the mean and standard deviation of the 
weight of an African Grey Parrot. 


4. ¥x=75, })x? =293, n = 30 

Find the mean and variance of X. 
5. Yix=2.1, ix? =83.1,n=7 

Find the mean and standard deviation of X. 
6. Si (x-*) =44.3, n= 16 

Find the variance of X. 


Variance and standard deviation 


7 ¥(x-¥) = 19 735.4, n= 37 
Find the standard deviation of X. 


8. Estimate the standard deviation of this distribution. 


o/ 


100 


Frequency density 


0 
20 30 40 50 60 70 80 90 a 


2.4 Which average should you use? 


4 


he simple answer is that if one measure was always the best then there would be no 
need for any others — so it depends! 


he mode is referred to as an average, but it isn’t the same sort of measure as the median 
and mean, and if you could only have one of these three, the mode is not the one 
you would choose. 


he median is the middle value (‘Mr Average’) - half of the other values are bigger 
and half are smaller — while the mean is the value which gives everyone an equal 
share. This suggests a strategy for deciding which measure to use: if the total of the 
values tells you something of interest, then the mean is likely to be the more 
appropriate. Note that it needs to be the fofal that tells you something of interest: 
if you were looking at the wages paid in a factory because you were thinking of 
applying for a job there, the median would be the right average to look at, since 

it would tell you what a typical worker is earning; on the other hand, if you wanted 
to buy the factory then the total wage bill would be of interest, since it tells you 
about running costs, and therefore the mean would be the more appropriate. 


So the most appropriate average is not simply a matter of whether the data are 
symmetrical, or whether there are outliers; using a set of data for different purposes 
may mean that different averages are appropriate. If the data are roughly symmetrical 
then the median and mean will be close together anyway, so it won't matter which 
you calculate - which tells you that symmetry is not a good criterion for choosing 
which to use. 


There is, however, one golden rule: the measures of centre and spread are not 
interchangeable! The median goes with the interquartile range and the mean goes 
with the standard deviation (or variance). 
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2.5 Coding 


The diagram shows two sets of data: 
Data set 1 represents the ages, in years, of a group of people; Data set 2 
represents the ages of the same people 15 years later. 


data set 1 * Age in 1999 
° © Age in 2014 
eeeee 
-evecopecsece—qeccessonere + 1 > 
30 40 50 60 70 go Age 
eeeee 
° 
data set 2 


All the data values have shifted by 15, so 
Mean of Data set 2 = (mean of Data set 1) + 15 
and there is no change to any measure of spread. 


‘The next diagram shows a set of temperatures in degrees Celsius (Data set 1) 
and the same temperatures in degrees Fahrenheit (Data set 2). 


data set 1 
~ 
eeeee 
-scceepecoeos + r poteeep eee ce om 
10 20 30 40 50 60 


data set 2 


To get the values in degrees Fahrenheit, all of the values in degrees Celsius have 
been multiplied by 1.8, then 32 added: 
Mean of Data set 2 = 1.8 x (mean of Data set 1) + 32 


The degree of spread is unaffected by the + 32. 
Standard deviation of Data set 2 = 1.8 x (standard deviation of Data set 1), 
so variance of Data set 2 = 1.8? x (variance of Data set 1). 


Ifa set of data values X is related to a set of values Y so that Y= aX + b, then: 
mean of Y= a x mean of X+ b 


standard deviation of Y = a x standard deviation of X 
variance of Y = a? x variance of X. 


You can use this idea to transform or code a set of numbers. 
You might do this to make the numbers easier to work with. 


Coding 


Example 7 

Here is the grouped data from Example 6, 

on the heights of 142 plants. (Recall that the 
class 65-70, for example, means at least 65cm. 
and less than 70cm tall.) 


Use the coding M = wees to find the mean and 


the variance of these data. 


Extend the table: 


Class 

55-60 
60-65 
65-70 
70-75 
75-80 
80-85 
85-90 


alulplwlrl|elo| = 


Df = 142 | } mf =410 | Ym’ f = 1354 


Then M= <2 = 2.887... and o%, = 12> — M? = 1.19857... = 1.20 (38.6) 
142 142 


To find the mean and variance of the uncoded data: 
m= M x5 + 57.5 = 2.887 x 5+ 57.5 =71.9 (3s.f.) 
o? =5?x a? = 25 x 1.19857... = 30.0 (3 s.f) 
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Exercise 2.4 
1. A set of observations {x} are coded using X =A 88. X =3.6, Var(X) = 5.3. 


Calculate the mean and variance of the original set of observations. 


2. A set of observations {x} are coded using X =~ =e X =8.2, Var(X) =3.22. 


Calculate the mean and variance of the original set of observations. 
3. The temperatures (in °F) at a resort are measured at the same time on 

eight successive Mondays: 

50.0 53.6 52.7 55.4 55.4 57.2 59.9 62.6 


a) Calculate the mean of these temperatures (in °F). 
F—32 
= 


b) Convert each temperature into °C using the formula C = 
5 


c) Calculate the mean of these temperatures (in °C). 
d) Check that the mean temperature in °F converts to the same mean in °C. 


4. The table shows information about the time taken by a bus to travel from 
Nikki's home to her grandmother's house. 


Time (minutes) | Frequency 
8<x<10 | 5 
10<x<12 | 14 
l2<x<l4 | 3 
l4cx<l6 | 2. «| 


a) Calculate estimates of the mean and variance of the times taken by the bus. 


x-9 
b) Code this data set using X = = 


2 
c) Find estimates of the mean and variance of the coded data. 


d) Check that the mean of x is 9 more than twice the mean of X, and the 
variance of x is four times the variance of X. 


Coding 


5. The table shows information about the salaries paid to a company’s 
employees. 
Salary Frequency 
$0 < x < $10000 12 
$10000 < x < $15000 63 
$15000 < x < $20000 32 
$20000 < x < $25000 21 
$25000 < x < $30000 8 
$30000 < x < $50000 4 
x —5000 
a) Code these data using X = 500 


b) Calculate estimates of the mean and variance of the coded data. 


c) Use your answers to part (b) to give estimates of the mean and 
variance of the salaries in the company. 


6. ‘The table shows information about the daily time spent by a nurse with 
each patient. 


lo<t< 15 312 
| 15 <t<20 479 
| 20<¢< 25 243 
| 25 <t<30 119 
a) Code these data using T = EBs, 


b) Calculate estimates of the mean and variance of the coded data. 


c) Use your answers to part (b) to give estimates of the mean and variance 
of the daily time spent by the nurse with each patient. 
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Summary exercise 2 


1. yx =150 by = 1535 n=25 EXAM-STYLE QUESTIONS 


Find the mean and variance of X. 6. Students’ marks on a history examination 


were: 73, 67, 76, 63, 59. 

a) Calculate the mean and standard 
deviation of these marks. 

For comparisons with other subjects, the 

marks are to be scaled so that they have a 

mean of 50 and a standard deviation of 10. 

This is to be done by using the 


EXAM-STYLE QUESTIONS 


2. Asummary of 20 observations of y 
gave the following information 


> (y-a) = -37 Yo-ay = 1529 
Find the mean and standard deviation of y. 


in _ X-a ‘ 
3. ‘The weights (in kg) of the hand transformation Y = > where X is the 


luggage carried on to a flight by 97 
passengers are summarised by 


DYix-5)=314 P(x-5)’ = 1623. 
Find yx and ae 
4. The test score, x, for a class of 14 students 


gives Dx = 919 and D2? = 60773. 


a) Calculate the mean and variance of the 
marks for this class. 


original mark and Y is the transformed mark. 
b) Calculate the values of a and b. 


7. A summary of 20 observations of x gave the 
following information: 


DY (x-a)=-23.2 and YP (x=-a)’=211.23 
The mean of these values of x is 8.95. 
i) Find the value of the constant a. 


ii) Find the standard deviation of these 


values of x. 
Another class of 15 students taking the same 


test scored a mean of 63.8, with a standard 
deviation of 5.58 marks. 
b) Calculate ¥x? for the second class. 


c) Calculate the mean mark for all of the 
students in the two classes. 


8. Arinda measured the resting pulse rates of 
some students. Her results are recorded below. 
68, 72, 67, 70, 74, 68, 72, 75, 67, 70 
Find the mean and standard deviation of 
these pulse rates. 


9. The amounts of money, x dollars, that 25 
people had in their pockets are summarised 
by })(x-40)=-23 and} (x—40)" = 347. 
Find 2 and pe 

10. The table shows the mean and standard 


deviation of the heights of boys and girls 
in a class. 


5. An airport bus service runs from the city 
centre and from the airport every 10 minutes 
during the day. The number of passengers 
ona random sample of journeys from the 
airport is shown below. 


11, 3, 14, 34, 1, 6, 12, 15, 8, 4, 28, 7, 3, 0, 8, 12 


Calculate the median and interquartile range 
of the number of passengers. 


Number | Mean Standard 
of pupils | (cm) | deviation (cm) 
Boys 15 172: 5.38 
Girls 12, 167.5 5.65 


ei Summary exercise 2 


i) Find the mean height of all 27 pupils in 11. A nurse records the pulse rates (in beats per 
the class. minute) of ten 4-year-old children and ten 


ii) The heights of individual boys are 14-year-old children. 


denoted by h, and the heights of 4-year-olds | 95 | 89 ]106] 92 114] 96 [107] 89 J115] 91 


individual girls by h,. By first finding 14 year-olds |72| 83 | 67| 75] 8087 | 68 | 76] 91 | 72 


Yh and YR find the standard 
deviation of the heights of all 27 pupils in Calculate the mean and standard deviation 


the class. of the pulse rates for each age group. 


Use your results to compare the distributions. 


Chapter summary 


Q is the median. 
To find the median, calculate tn. 


If 4 nis an integer r then Q, is the mid-point of x, and x... 
ifSn lies between rand r+ 1 then Q is x,,,. 


Q, is the lower quartile and Q, is the upper quartile. 
For a data set with n values (x,, x, ..., x,), to find Q,, calculate ¢n. 


If <n is an integer r, then Q, is the mid-point of x, and x... 


If tn lies between r and r + 1, then Q, is x,,. 
‘The same process is used to find Q., but using fn instead. 


‘The interquartile range (IQR) is the difference between the lower and upper quartiles: 
IQR= Q,-Q,. 


For a set of data, 


Mean = x = — or ae 


and 
Variance => @axy =o, or ae 
Sie ee ee oe 


For grouped data, the mid-point of the interval is used for x in each case. 


Measures of location and spread 


e The standard deviation is the square root of the variance and is a measure of how spread 
out the data are. It is measured in the same units as the data. 

e Ifaset of data values X is related to a set of values Y so that Y= aX + b, then: 
mean of Y=a x mean of X + b, standard deviation of Y = a x standard deviation of X, 
and variance of Y = a? x variance of X. 


kya Summary exercise 2 


Representing and analysing data 


June P August 


May 
Florence Nightingale worked as a nurse in the Crimean 


Sept war despite being born into a wealthy British family. She 
is famous for reforming hygiene in hospitals when she 


Zee 
| 7 ig discovered that more soldiers were dying from diseases 
7 aa in the hospitals than from their battle wounds. It was 


March 
her success in communicating these findings by using 


innovative statistical diagrams (coxcombs - the first 
Jan 1855 


Apr 1854 


examples of pie charts) which gave her discoveries the 
political capital to make a series of fundamental health and 
hygiene reforms. As a result, she was the first woman to 

Dec be elected to the Royal Statistical Society, and she was the 


first woman to receive the Order of Merit, which is Britain’s 


highest civilian honour. 


Objectives 
e Select a suitable way of presenting raw statistical data, and discuss advantages and/or 
disadvantages that particular representations may have. 


e Construct and interpret stem-and-leaf diagrams, box-and-whisker plots, histograms and 
cumulative frequency graphs. 


e@ Use acumulative frequency graph to estimate the median value, the quartiles and the 
interquartile range of a set of data. 


Before you start 


You should know how to: Skills check: 
1. Find the median and quartiles of a data 1. For the data sets given, find the median and 
set given as a list, a frequency table and quartiles. 
a grouped frequency table, e.g. Find the a) 8,8, 9, 10, 11, 12, 16, 18, 19, 22, 23 
median and quartiles of the following b) 
a Number of 
lata sets. 
a) 7,7,8,9, 10, 12,14 correct answers | 3) 4 |5 | 6 | 7 
Nk le in short test 
7, 7,8, 9, 10, 12, 14 - the median is the 
Frequency A |) 7' 16: [3 | 2 


middle value (9), the lower quartile (7) is 
the middle value of the bottom half, 

and the upper quartile (12) is the middle 
of the top half. 


b) | Number of 
L.| 2) || 3:4.) 5 
beans per pod 
Frequency 21/5|8|6|4 


There are2+5+8+6+4=25 values 
summarised in the table so the median is 
the 13th, which is a 3, the lower quartile 
is the 7th, which is a 2, and the upper 
quartile is the 19th, which is a 4. 


3.1 Stem-and-leaf diagrams 


Stem-and-leaf diagrams give the shape of a distribution in the same 
way as a histogram with equal intervals does, but also keep the detail 
available. 


‘The ordering in the stem-and-leaf diagram makes it easy to find the 
median and quartiles, as well as the highest and lowest values. 


The stem-and-leaf diagram below shows the heights of some plants: 


3/5 7 7 (3) 
aio? 2 8 *£ P © 
5/1 4 7 8 (4) 
6 | 2 (1) 


key 5 | 2 means52cm 


There are 13 plants, so the median is the 7th — shown in red (44cm). 


Construct a stem-and-leaf diagram in two stages: 
(1) First assign the values to the correct place on the stem. 


(2) Then order each ‘leaf’ in turn. 


The data in this diagram might have been provided exactly as it was 
collected: as a simple unordered list of data. 


42, 57, 37, 62, 51, 42, 47, 35, 54, 43, 44, 58, 37 


In this case, after the first stage the diagram will be: 


SF St 

4] 2 7 3 4 
5/7 1 4 8 

6 | 2 


Stem-and-leaf diagrams 


Then it is easy to order each row of data to get the final 
stem-and-leaf diagram. 


3 5 7 7 


key 5 J2 means 52cm 


In Chapter 2, you learned how to find the quartiles and interquartile range 
for a set of data — here the lower quartile (42 cm) and upper quartile (54cm) 
are highlighted in blue, giving an interquartile range of 12cm. 


Two sets of related data can be shown as a back-to-back stem-and-leaf diagram, 


where a common stem is used and the two sets of data go to the left and right. 


Back-to-back stem-and-leaf diagrams allow comparison of the 
distributions to be made. 


You should make a comment about the location and spread of the two, but 
try to describe them in context rather than just list values of medians and 
interquartile range, e.g. if another type of plant had heights which were all in 
the 50-70cm range you could say that the first type tended to be smaller and 
were more variable in height than the second type of plant. 


Example 1 


‘The heights of samples of two types of plants are represented in this back-to-back stem-and-leaf 
diagram. Compare the heights of the two types of plants. 
TypeA TypeB 
(4) 

(2) (6) 

(6) 6 (5) 

(5) 4 (2) 

(3) 5 eb || 4 

Key 1 | 4 | 5 means 41 cm for type A and 45cm for type B 


The median of type A is 50.5 cm and of type B it is 38. The IQR for A is 12 and for B it is 13. 


Plants of type A are taller on average than When comparing distributions, you should 


plants of type B, and the two types of plant. __ comment on average and spread. The comments 
have similar variability in their heights. need to be in the context of the data. 


Representing and analysing data 


The number of minor mistakes on driving tests marked by an examiner 
(A) are listed below. 


12| 8] 11] 14] 18] 14] 15] 11 
9 |7] 12] 18] 11] 17] 16] 10 


Using two stems with leaves 0-4 and 5-9 and assigning each data 
value to a stem. 

0/8 9 7 

1/2 1441 2 1 0 

1/8 5 8 7 6 


Then ordering the leaves gives the final stem-and-leaf diagram. 


0/7 8 9 
1;0 1 1 1 2 2 4 4 
1/5 6 7 8 8 


Key: 1 | 6 means 16 minor mistakes 


‘This stem-and-leaf diagram shows the number of minor mistakes made on 
driving tests marked by another examiner, B. 


oli 4 (2) 
0/5 77 (3) 
1]/0 0123 3 4 4 (8) 
1/6 788 (4) 
2]}0 1 (2) 
Key: 1 | 6 means 16 minor mistakes 
Examiner A Examiner B 
o}1 4 (2) 
(3) 9 8 7/0 {5 7 7 (3) 
(8) 4 4 2 1 o/1j]0 0 1 2 3 3 4 4 (8) 
(5) 8 7 6 5/1/6 7 8 8 (4) 
2/0 1 (2) 


Key: 2 | 1 | 5 means 15 minor mistakes for Examiner B 
and 12 minor mistakes for Examiner A 


The two examiners seem to identify approximately 
the same number of minor mistakes on average, 
however Examiner B has a much wider range. 


kic@m ©Stem-and-leaf diagrams 


Exercise 3.1 


1. The data below records the masses, in grams, of samples of 35 plums of two different types. 


Type A Type B 
qa) 2|5 qa) 4/4 
3) 3 {024 (ll) 4 |56778889999 
(5) 3|55779 (14) 5 |00001111122334 
(9) 4 |122224444 (9) 5 |556677899 
oo Key: 4 | 5 means 45 grams for each diagram 
(3) 5 |789 


a) For each type, find the median and interquartile range of the masses. 

b) Draw a back-to-back stem-and-leaf diagram to show these data in a single diagram. 
c) Compare the characteristics of the two types of plums. 

d) Ifyou were a fruit grower, which of the two types would you plant? Give a reason. 


2. A group of students measured their pulse rates at the start of a statistics lesson. They then did five 
minutes of moderate exercise and measured their pulse rates again. The results are summarised below: 


Before exercise After exercise 
(3) 9 8 6 | 6 
(4) 7 & & O97 
(4) 8 6 & 1 8 | 4 5 7 T.8 (5) 
(3) 3.3 2 9 1 3 4 4 6 9 (6) 
(1) 0) 10 2 5 5 (4) 


Key: 1 | 9 | 5 means 91 beats per minute before exercise 
and 95 beats per minute after exercise 


Compare the pulse rates of the group before and after exercise. 


3. Samples of the weights of Dunnocks, a small brown bird, are taken in January (during 
the winter) and April (when their breeding season starts). 
January April 

18 | 6 (1) 
(1) 5 | 19] 14578 (5) 
(4) 9621 | 20] 0244589 (7) 
(7) 9885520 | 21 | 0125556789 (10) 
(8) 76533322 | 22] 00233489 (8) 
(8) 99944210 | 23] 4 (1) 
(5) 43210 | 24] 57 (2) 
(1) 2 | 25 
(2) 20 | 26 


Key: 7 | 22 | 5 means 22.7 grams in January and 
22.5 grams in April 


Compare the weights of Dunnocks in January and in April. 
Representing and analysing data [iiRyé 
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4. The wingspans of a sample of male and female Eurasian Eagle-Owls were collected, 


and are summarised in the diagram below. 


Male Female 
(2) 97 |15 
(5) 86632 |16] 8 9 
(6) 766531 /}17| 0 3 
(1) 3 |is| 25 
19 0 3 


Key: 1 | 17 5 means 171 cm for males and 175cm 
for females 


Compare the wingspans of male and female Eurasian Eagle-Owls. 


(2) 
(5) 
(5) 
(2) 


5. The number of grams of carbohydrates in a typical portion of some different 
fruits and vegetables are given. Construct a back-to-back stem-and-leaf diagram 
and compare the amount of carbohydrates in fruit and vegetables. 


Fruit 
34 3 30 12. 23 12 20 19 
15 26 13 19 IY 26 21 


Vegetables 
4 6 8 7 5 4 2 5 
2 2 3 Wb 26 18 5& 


3.2 Box-and-whisker plots 


Box-and-whisker plots (sometimes referred to as boxplots) 
typically show only five ‘points’ from the distribution — the top 
and bottom of the range, the median and the upper and lower 
quartiles (UQ and LQ). 


‘These five values are sometimes known as the quartiles and the 
notation Q,, Q,, Q,, Q, and Q, is sometimes used to refer to 
minimum, LQ, median, UQ and maximum respectively. 


The pair of box-and-whisker plots on the right show the heights of 
two groups of plants. We see that Group 1 are much more consistent 
in their heights than Group 2, and Group 2 plants tend to be taller. 


Box-and-whisker plots 


Heights 


75 


65 


55 


Heights of two groups of plants 


Groups 


The decathlon is a set of 10 running, jumping and throwing events in 
which athletes are awarded points in each event based on their 
performance. The total points across the 10 events determines the 

result of the decathlon competition. The three box-and-whisker plots below 
show the scores across the 10 events for the two best decathetes ever 

(up to June 2014), who scored 9039 and 9026 points, and the personal 

best of the 50th best decathlete (8506 points). 


The strengths and weaknesses of the box-and-whisker plot are vividly illustrated 

by this example. The ‘average’ for the two best decathletes is very similar - the middle of 
the box — but Roman Sebrle’s performances across the 10 events were much 

more consistent that Ashton Eaton's. Valter Kiilvet’s performances are even 

more consistent but at a much lower level than the other two. The weakness is 

that you have no detail beyond the five summary statistics - and in this case 

that means you cannot make any comparison of performances in individual events. 


Scores on the 10 events in the decathlon 


Valter Kiilvet (8503 points) 
Soin ~ ‘eal 


Roman Sebrle (9026 points) = ict 
- Second best decathlete ever 


Ashton Eaton (9039 points) __, i —| 
- Decathlon world record 


0 200 400 600 800 1000 1200 


Outliers 


Outliers are values that are uncommonly large or small for the data. 
One common definition of an outlier is a value which is more than 
1.5 x IQR above the UQ or below the LQ - these limits are known as fences. 


x-Q,>15x(Q,-Q) 
Q,-x>15x(Q,-Q) 


We can see that there are no outliers for any of the three decathlons shown 

above — none of the whiskers extend far enough to be 1.5 times the width of the 
box. Question 2 below represents outliers by *s beyond the main whisker — allowing 
you to see the bulk of the distribution and any unusually large or small values 
individually. Because of the amount of work involved in calculating what is an 
outlier and in plotting, outliers are not part of this course, but computers can be 
programmed to plot them automatically, and you should be familiar with the ideas. 


Representing and analysing data 


Temp. 


other information you might wish to know. 7 


Exercise 3.2 S 
1. These box-and-whisker plots show the average daily 31 
temperatures in two holiday resorts in July. Compare the 30 
two resorts as holiday destinations, and comment on - = 

: 7 


A B 
Resort 


2. The two box-and-whisker plots below relate to the salaries in a small college in the USA. 


a) ‘The first shows the salaries paid during the 1991-1992 academic year, separately 
for males and females. Is there any difference in the treatment of men and 
women in this college? 


Boon 25000 
50000 

45000 

40000 ° 

35000 § 15000 
30000 i 

25000 Hy 
20000 


1991 $ 


0 


Gender Gender 


b 


‘The second shows the staff members initial salary, in whatever year 
they joined the staff, again shown separately for males and females. 
Does the information contained in this diagram alter any views you 
formed from the first data set? If so, explain how. 


3. The quartiles of the number of employees in random samples of firms in 2010 
and in 2015 are given in the table below, together with any outliers which are more 
than 1.5 x the interquartile range beyond the quartile. 


Min | LQ | Median UQ Max Outliers 
2010 uy 6 18 42 145 106, 131, 145 
2015 1 SI 13 55! 160 95, 160 


a) Construct a box-and-whisker plot for each year. 


b) Compare and comment on the two distributions. 


et ~Box-and-whisker plots 


4. The table shows the serving sizes the US Food and Drug Administration use for 
15 foods in each of the nutrition factsheets they issue for fruit, vegetables and 


sea food. 

Fruit | 242 30 | 126} 134] 126 | 134] 148 | 154} 147 | 166 | 112 | 161 | 147 | 140} 280 
Vegetables | 93 | 148/148] 78 | 99 | 110} 99 | 84 | 89 | 85 | 84 | 148] 148] 90 | 148 
Seafood 84 | 84 | 84 | 84 | 84 | 84 | 84 84] 84 | 84 | 84 | 84 | 84 | 84 | 84 


a) Find the quartiles for each of the three types of food. 


b) Why would you not draw a boxplot for the serving sizes of seafood? 


c) Using a single axis, construct box-and-whisker plots for the serving 
sizes of fruit and vegetables in the table. 


5. The diagram shows the scores registered for the 100 metres, the long jump 
and the 1500 metres in the personal bests for the top 75 decathletes up to 
June 2014. Compare the performances in the 3 events. 


100 metres —> io 


Logiune > |i 


1500 metres —> ||| } +] 


0 200 400 600 800 1000 1200 
3.3 Histograms 


Ina histogram, the area of each bar is proportional to the frequency in 
the interval. 


When the intervals are not of equal width the height of the bar is the 
frequency density and it must be scaled. 


frequency 


e Frequency density = aefeamlviaahe 
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Example 2 


The heights of the children in a school are measured correct to the nearest centimetre and are 
summarised in the table: 


Height (cm) | 120-129| 130-139] 140-144] 145-149} 150-154| 155-159} 160-169} 170-179 
Frequency 60 80 50 93 77 67 72 54 


Draw a histogram to represent this data. 

‘The interval end-points are 

119.5, 129.5, 135.5, 139.5, 144.5, 149.5, 154.5, 
159.5, 169.5 and 179.5 

so the widths of the intervals are 10, 10, 5,5, 5, 5, 
10 and 10. 

‘The heights of the bars will then be 


GN OL BU A 5 aya ah aye oa 54 _ 
7G =6; 10 8; 5 10; 5 18.6; 5 15.4; 5 13.45 10 7.2 and 10 5.4 
and the histogram looks like this: 


12 


Frequency density 


< The horizontal axis should be drawn as a 


390 20)" 180) 150" 150/180) 270) 480 linear scale showing the variable. 


Height (cm) 


Relative frequency histograms 


In Chapter 9 you will meet the normal distribution, which 


is an example of a probability distribution. > 0.03 
In a relative frequency histogram, the total area is defined 5 
to be 1 square unit, and then the area of the histogram 2 0.02 
between any two values will represent the proportion of E 
data which lies between those two values. £ 001 
3 0. 
To calculate the heights of the bars for a relative frequency @ 
histogram, you can divide the frequency densities by the ooo 


total frequency — this will automatically scale the total areatobe 1. 110 120 130 140 150 160 170 180 
The earlier example looks like the distribution on the right. Height (cm) 


Histograms 


Example 3 


The ages of members of a cricket society are summarised in the table below. 


Age (years) 25-34 35-44 45-49 50-54 55-59 
Frequency 6 9 13 9 4 


a) Draw a relative frequency histogram to display this information. 


b) Calculate an estimate of the number of members aged between 40 and 52, inclusive. 


Since these are ages, the upper and lower class boundaries (UCB, LCB) of the first interval are at 
25 and 35 etc. A table is the best way to show the calculations. 


Ages | Frequency | Interval width | LCB | UCB | Frequency | Relative frequency 
density density 
25-34 10 25 35 0.6 0.012 

35-44 10 35 45 0.9 0.018 

45-49 5 45 50 2.6 0.052 

50-54 J 50 55 1.8 0.036 

55-59 i} 55, 60 1.4 0.028 

60-74 15 60 0.4 0.008 

Total 50 


To estimate the number of members between 
40 and 52, include half of the 35-44 interval, 
all of 45-49 and 2 of the 50-54 interval. 
Taking the same proportions of the numbers 
in the intervals gives 

0.5 x9 + 13 + 0.6 x 9 = 22.9 as the estimate. 


Relative frequency density 


25 30 35 40 45 50 55 60 65 70 75 
Ages 


Exercise 3.3 

1. Drawa histogram of the following data: 
Length (cm) 20-50 | 50-60 | 60-65 | 65-70 | 70-80 | 80-90 | 90-110 
Number of plants 27 18 16 15 22 | 14 14 


2. The table gives the daily protein intake of a sample of 130 people in a country. 


Protein (grams) 0-15 —25 | -30 | -35 | -40 | -45 | -50 | -55 | -65 
Number of people 18 14 11 16 21 18 15 11 6 


Representing and analysing data 


a) Drawa histogram for these data. 

b) Calculate estimates of the mean and standard deviation of the daily protein 
intake of the people in the country. 

c) Corresponding data were collected in another country. The mean of that 
set was 45.1 grams and the standard deviation was 5.9 grams. 
What does this tell you about the diets of the two countries? 


3. A question on a survey asked the students how long their journeys to school had 
taken on a particular day. A summary of the results is shown below. 


Time Number of children 
(minutes, correct to the 
nearest minute) 


1-15 115 
16-25 46 
26-35 36 
36-55 22 
56-80 14 


Illustrate these data using an appropriate diagram. 


4. The table shows information about the salaries paid to employees in a company. 


Salary Frequency 
$0<x<$10000 Z 
$10000<x<$15000 82 
$15000 <x<$20000 45 
$20 000 <x<$25 000 24 
$25 000 <x<$30000 13 
$30000 <x<$50000 4 


Draw a histogram to represent these data. 


5. The length of time students take to complete 


Time, t Number of pupils 
a mathematical puzzle is summarised in the table. (minutes) 
a) Drawa relative frequency histogram to 0<t<2 6 
represent these data. 2<ts5 8 
b) Calculate an estimate of the number of students 5<ts8 15 
who took more than 10 minutes, but not more 8<ts12 14 
than 15 minutes, to complete the puzzle. 12<t<20 7 


gee Histograms 


3.4 Cumulative frequency graphs 


When data is given in a grouped frequency 
table, you can use interpolation to estimate 
the value of quartiles. 


Example 4 
A question on a survey asks students how long their journey to school took on 
a particular day. A summary of the results is shown below. 


Time Number of students 
(minutes, correct to the 
nearest minute) 
1-15 
16-25 46 
26-35 36 
36-55 22 
56-80 14 


Calculate estimates of a) the median and b) the upper quartile for this data. 


Rewrite the table with cumulative frequenci 


Time Number of students UCB | Cumulative frequency 
(minutes, correct to the 
nearest minute) 

1-15 0.5 155 
16-25 46 15.5 25.5 
26-35 36 25:5 35.5 
36-55 22 35.5 55.5 
56-80 14 55.5 80.5 


a) n= 233, and the median is the 117th value; 117 is between 115 and 161, 
so the median lies in the 16-25 interval. 
117 - 115 = 2 and there are 46 students in the interval, so the estimate 


will be 2. of the way through the interval. 


The interval starts at 15.5 and is of width 10 minutes so the 
median 


1S Sef = * 10 = 15.9347... = 15.9 minutes. 


> Continued on the next page 


Representing and analysing data 


b) n=233, and the upper quartile is the 175th value (233 x = = 174.75); 
175 is between 161 and 197, so the upper quartile lies in the 26-35 interval. 
175-161 = 14 and there are 36 students in the interval, so the 
estimate will be # of the way through the interval. 
The interval starts at 25.5 and is of width 10 minutes so the 
upper quartile is 


25.5 + u x 10 = 29.3888... = 29.4 minutes. 


You may prefer to use a formula for interpolation: 
e Estimated value of quartile = 


Q-cum. freq. at start of interval 


lower class boundary + ke interval width 


interval frequency 
where Q is the position of the quartile required. 
An alternative approach is to plot the cumulative frequencies on 


a graph, using the interval end-points as the x-coordinate and the 
cumulative frequency as the y-coordinate. 


If you then join these points with straight lines, or by drawing the smoothest 
curve through them that you can, you can use the graph to estimate the 
times for any number of students you are interested in, including the quartiles. 


from the LCB for 
the first interval 


‘The points to be plotted are (0.5, 0); (15.5, 115); (25.5, 161); (35.5, 197); 
(55.5, 219); (80.5, 233) - remember to look carefully at what the interval 
end-points will be, it is normally easy to get them right provided you do 
think specifically about how the data was recorded and any rounding 
that took place. 


Straight-line graph 


Cumulative frequency 


10 20 30 40 50 60 70 80 
Time (minutes) 


et Cumulative frequency graphs 


250 


nN 
8 
s 


Cumulative frequency 


Best smooth curve 


' 


10 20 30 40 50 60 70 80 
Time (minutes) 


You can see that, for this set of data, the two graphs are not very different. 
Either approach is acceptable for this examination so choose one, stick 


with it, and draw it clearly. 


e The 


an interval equally, and changes the rate of oc 
each interval. 


straight line graph is effectively spacing all the data values within 


currence at the end of 


e The interval end-points are often arbitrary cutoffs and there is not actually 


any change in the rate of occurrence there - t 


and 
e The 


before and after influence what we think will 


occur continuously. 


he changes are more gradual 


‘best smooth curve is trying to produce the best fit - it lets the intervals 


happen in an interval - for 


example if there is 40 in an interval and then 75 in the one above but only 6 
in the one below, we would expect more of the 40 to occur towards the top 


end 


of that middle interval. 


Exercise 3.4 


1. A group of students recorded their pulse rates when they were in a 
relaxed restful state. 


53 | 54 | 55 | 55 | 57 58 | 59 | 59 | 59 | 62 | 62 | 62 

62 | 62 | 63 | 63 | 63 | 64 | 64 | 65 | 65 | 67 | 67 | 67 

68 | 68 | 69 | 69 | 70 | 71 | 71 | 73 | 75 | 79 | 80 | 83 
a) Calculate the median and the quartiles of these pulse rates. 


b) 


°) 


Construct a grouped frequency table for the students’ pulses, 
using intervals 51-55; 56-60; 61-65; 66-70; 71-75; 


76-80; 81-85. 


Calculate estimates of the median and the quartiles of these 
pulse rates using the table in part (b) using interpolation. 


Representing and analysing data 


2. A group of 45 women go on a skiing holiday. 


Their ages are listed below. 


17 | 17 | 17 | 19 | 19 | 19] 20'| 22 | 22 | 22'| 23] 23 | 24 | 25] 25 


25 | 26 | 26 | 26 | 26 | 27 | 27 | 28 | 28 | 28 | 29 | 29 | 29 | 30 | 32 


32 | 32 | 34 | 34 | 35 | 35 | 35 | 35 | 36 | 37 | 38 | 38 | 44 | 45 | 48 


a) Calculate the median and quartiles of their ages. 

b) Construct a grouped frequency table for the women’s ages, using intervals 
17-20; 21-24; 25-28; 29-32; 33-48. 

c) Draw a cumulative frequency graph for the data table in part (b). 


d) Use your cumulative frequency graph to estimate the median and quartiles 
of their ages. 


3. The table shows information about the salaries paid to employees in a company. 


Salary Frequency 
$0<x<$10000 24 
$10000<x<$15000 127 
$15000<x<$20000 45 
$20 000 <x<$25 000 24 
$25 000 <x<$30000 13 
$30000 <x<$50000 4 


a) Drawa cumulative frequency graph for the salaries data. 

b) Use your graph to estimate the median and quartiles. 

c) Estimate the proportion of the employees who earn more than twice 
the median salary. 


4. ‘The length of time pupils take to complete a mathematical puzzle is summarised 


in the table. 
Time, t 
. Number of il: 
Geniistes) jumber of pupils 
Ost<2 
2st<5 
5St<8 1S. 
8<t<12 14 
12<t<20 ih 
a) Drawa cumulative frequency graph to represent these data. 


Calculate estimates of 
b) the median 
c) the interquartile range of the times taken. 


ett =Cumulative frequency graphs 


3.5 Skewness 


Skewness is the term used to describe the lack of symmetry in a distribution. 


Histograms and box-and-whisker plots can show skewness quite well, particularly where data is not 
symmetric about the median. 


A positively skewed distribution has: A 


e A long tail to the right H —— 
© Q-Q>Q-Q 


A negatively skewed distribution has: A 
e A long tail to the left 


© Q-2<Q-Q | t 1 


A symmetric distribution has: 
e Equal length tails 


* @-0.-0,-Q, 


There are some common measures of skewness: 


‘These measures quantify skewness as well as giving the sign 
(positive or negative). 


3x(mean - median) and —™ean= mode 


standard deviation standard deviation 


The quartile skewness coefficient is defined by 


(Q-@)-(Q-Q) _ Q-20+Q 
Q,-Q Q-Q ~ 


All these measures allow comparisons to be made between different distributions. 


Representing and analysing data 
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Example 5 

Here are the summary statistics for some data: 
Mean = 10.7, 

standard deviation = 9.3, ane 
lower quartile = 3.6, 
median = 8.6, 

upper quartile = 15.4 and 


mode = 2.7. 
The boxplot is shown on the right. 
a) Describe the skewness of this distribution, giving two reasons. 


ences dit 2 
b) Calculate the skewness coefficient sek and state what it means. 
stan ie" mn 


a) There is a positive skew since 


i) the boxplot shows a long tail to the right 
ii) mode < median < mean. 
3x(mean—median) _3x(10.7-8.6) 6.3 
standard deviation 9.3 oh) 
‘The positive value indicates that the skew is positive. 


= 0.68 


Outliers 


Outliers are extreme values, and they are common in skewed distributions. 
An outlier is often regarded as being: 


e Any value which is more than 1.5 x IQR above the upper quartile or 
below the lower quartile: 


x-Q, >1.5x(Q,-Q) 
Q -x>1.5x(Q, -Q) 
Or... 


e Any value which is more than 2 standard deviations above or below the mean: a 
o 


Skewness 


Exercise 3.5 


1. Summary statistics are given for three groups in the table below. 


Group A Group B Group C 
Mean 35.7 89.2 54.0 
Standard deviation 6.4 11.3 7.2 
Median 32.1 95.3 56.5 
Lower quartile 28.3 81.6 47.2 
Upper quartile 41.2 102.5 62.7 
Mode 29.1 96.1 55.8 


a) Calculate the coefficient of skewness 


3 x(mean - median) 


standard deviation 


for each of the groups. 


b) Which of the three distributions is most skewed by this measure? 
2. Summary statistics are given for three groups in the table below. 


Group A Group B Group C 
Mean 84.1 79.3 56.1 
Standard deviation 6.2 17.2 8.3 
Median 85.1 84.2 52.1 
Lower quartile 79.3 71.1 41.0 
Upper quartile 90.1 102.7 59.3 
Mode 86.2 85.3 53.1 


a) Calculate the coefficient of skewness Q 


2Q, 
Q-Q 


+Q for each of the groups. 


b) Which of the three distributions is most skewed by this measure? 


c) Does the coefficient of skewness 
of skewness for these groups? 


standard deviation 


mean — mode 


give the same order 


3. The ages of applicants for mortgages are recorded by an estate agency. 
The results are shown below. 


25, 29, 27, 32, 45, 34, 26, 28, 30, 42, 26, 51, 29, 27, 33, 27 


a) Calculate the mean of these data. 


b) Draw a stem-and-leaf diagram to represent these data. 
c) Find the median and the quartiles of these data. 

An outlier is an observation that falls either 1.5 x (interquartile range) 

above the upper quartile or 1.5 x (interquartile range) below the lower quartile. 
d) Determine whether or not any items of data are outliers. 


e) On graph paper draw a boxplot to represent these data. Show your scale clearly. 


f) Comment on the skewness of the distribution of ages of applicants for mortgages. 


Justify your answer. 


Representing and analysing data 


4. Ata metro station, a regular passenger times (in seconds) how long he 
has to wait for a train to arrive once he gets to the platform. These data 
are listed below. 
87 42 0 62 124 0 58 37 74 94 
182 23 17. 62 «29 17. 82 #54 0 45 
a) Find the median and interquartile range of the waiting times. 
An outlier is an observation that falls either 1.5 x (interquartile range) 
above the upper quartile or 1.5 x (interquartile range) below the lower quartile. 
b) Draw a boxplot to represent these data, clearly indicating any outliers. 
c) Comment on the skewness of these data. Justify your answer. 


d) Explain how a zero waiting time occurs. 
3.6 Comparing distributions 


There are various measures of average that you can use to compare distributions, 
and you are not expected to calculate all of them. 


The mean uses all data values, The median is the middle value, The modal class has the highest 
but can be distorted by outliers. and is less influenced by outliers. frequency density - in a histogram, 

1 1 it is the interval with the tallest block. 
—00000-O——$$——___———o. —00000-0-——o 


Similarly there are various measures of spread as well. 


The standard deviation is used The interquartile range is used The range is the difference between 
with the mean. It can therefore be with the median. It concerns the the highest and lowest values, so it is 
distorted by outliers. The diagram middle 50%, and is unaffected by badly affected by extreme values. 
shows one standard deviation outliers. 

either side of the mean. kK ~-—————___—>| 


—c0000-0 ———o- 
So — rd 


If you are comparing distributions, 


oO make the comparison in context 


oO make reference to both the average values and the spread. 


It is easier to make comparisons between two distributions than it is to describe one 
set of data from a graph. Histograms, box-and-whisker and stem-and-leaf diagrams all show 
average values, spread and whether the distribution is reasonably symmetric. 
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he following data are marks on a test out of 30 for two classes, A and B. 


By 10 20 30 

On average, class B have done better than class A. There is a much greater 
spread of marks in class B than in class A. 

The other striking feature here is that the top half of the marks in class B are 
very spread out in comparison with the bottom half of B and both halves of A. 


Example 6 

‘The times taken by pupils from three schools (A, B, C) to complete a mathematical 
challenge are summarised in the box-and-whisker plots below. Compare the performance 
of the pupils in the three schools. 


5 


Pupils in schools A and B generally did better than school C - they 
completed the puzzle quicker and there was less variability in the times 
they took. A had the quickest pupils but also had pupils slower than 
any of B’s. 


In cumulative frequency (CF) graphs, the shape of the distribution is much less clear, 
but it is worth remembering that the gradient of the CF graph represents the 

rate at which observations occur — which is the height of the histogram at that 

point in the distribution. 
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In comparative bar charts, make sure you talk about comparisons between each 
of the variables — so if you have data on boys and girls of different ages, talk 
about both - even if there is no difference across one variable. 

Proportion of pupils who own a mobile phone 
80% 


Di Boys 
60% 4 Oi Girls 


SICA RUIBUIBIILIRLBLLBLIE 
1 2 3 4 5 6 7 8 9 
Year group 
‘The proportion of boys and girls owning mobile phones is very similar, but older 
pupils are more likely to own a mobile phone. 


Exercise 3.6 


1. A group of students took their pulse rates at the end of a mathematics lesson. 
The mean was 64.2 and the standard deviation was 6.5. 


At the end of the next lesson, which was PE, they took their pulse rates again. 
‘The mean was now 71.6 and the standard deviation was 9.1. 
Compare the pulse rates before and after the PE lesson. 


2. A new diet is claimed to increase the speed of weight loss. A random sample 
of 12 volunteers followed the diet and recorded their weight loss in one week. 
‘The results were (in kilograms): 


13 11 O05 13 15 L4 08 12. 10 O08 12 11 


a) Calculate the mean and variance of the weight loss with this diet. 
A traditional diet has a weight loss with a mean of 0.8kg and variance 0.1 kg”. 
b) Compare the weight loss from the two diets. 


3. The box-and-whisker plots show the lengths (in cm) of a certain type of 
plant found in two gardens. Compare the plants in gardens A and B. 


kZa Comparing distributions 


Summary exercise 3 


EXAM-STYLE QUESTIONS 


1. 


Air traffic control at an airport record the 
delays in arrival times of flights into the 
airport. On one day, 40% of all flights had no 
delay, the longest delay was 44 minutes and 
half of all flights had delays of no more than 
4 minutes. A quarter of all delays were at 
least 18 minutes, but only one was more than 
25 minutes. 
An outlier is an observation that falls either 
1.5 x (interquartile range) above the upper 
quartile or 1.5 x (interquartile range) below 
the lower quartile. 
a) On graph paper, draw a boxplot to 
represent these data. 
b) Comment on the distribution of delays. 
Justify your answer. 
The box-and-whisker plot below summarises 
the delays at the same airport on another 
day. 


x 


—+ 7 
O 5 10 15 20 25 30 35 40 45 51 


c) Compare the delays to flights arriving at 
the airport on the two days. 


An airport bus service runs from the city 
centre, and from the airport, every 10 
minutes during the day. The number of 
passengers on a random sample of journeys 
from the airport is shown below. 

11,, 3,, 14, 34, 1,6, 12, 15, 8, 4, 28; 7, 
3; 0, 8, 12. 

Draw a stem-and-leaf diagram to represent 
these data. 


3. 


A regular airline passenger keeps a record 
of the amount of time which passes after the 
plane lands before she has collected her bags 
and exited the terminal. The times, to the 
nearest minute, for all the journeys she has 
made in the past six months are summarised 
in the table below. 


Time to exit terminal | Number of flights 
5-6 12 
7-9 15 
10-14 12 
15-19 8 
20-29 11 
30-49 14 


a) Give a reason to support the use of a 
histogram to represent these data. 

b) Write down the upper class boundary 
and the lower class boundary of the class 
10-14. 

c) On graph paper, draw a histogram to 
represent these data. 

d) Calculate estimates of the mean and 
standard deviation of the time to exit the 
terminal. 

On international flights the passenger 

had to go through passport control and 

immigration as well as collecting baggage 

before exiting the terminal. The shortest 

time she took to exit on any of the 12 

international flights was 25 minutes. 

e) State, with reasons, what effect excluding 
the international flights may have on 
the size of: 


i) the mean 


ii) the standard deviation. 


Representing and analysing data 
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4. A group of 79 students want to enter a problem-solving competition. The times they take to solve 
a qualifying question are summarised in the table below. 


Time (to nearest second) 15-20 | 21-30 | 31-35 | 36-45 | 46-60 
Frequency 18 24 10 12 15 


i) Drawa histogram on graph paper to represent this information. 
ii) Calculate an estimate of the mean time taken to solve this problem. 
iii) State which class contains the lower quartile and which class contains the upper quartile. 


Hence find the least possible value of the interquartile range. 


5. The following back-to-back stem-and-leaf diagram shows the times to load a program on 
25 desktop computers and 31 tablets. 


Desktop computer | | Tablet 
(4) 9 8 7 S 2 
(7) 9 7643 2 1 3 7 8 (2) 
(6) 5:8 5 3 Lf tL 4 02 5 8 9 (5) 
(5) 8 6 3 1 1 §. 12325 6 7 8 9g (8) 
(3) 4 3 1 6 122 45 67 8 9 (9) 
vA 01 12 4 5 8 (7) 


Key: 8|4|5 means 0.48 seconds for a desktop computer and 0.45 seconds for a tablet 


i) Find the median and quartiles for desktop computers. 
You are given that the median, lower quartile and upper quartile for tablets are 0.61 seconds, 
0.51 seconds and 0.69 seconds respectively. 

ii) Represent the data by drawing a pair of box-and-whisker plots in a single diagram on 
graph paper. 

iii) Compare the loading times for these two types of computer. 


6. The following table has some statistical data for 18 countries relating to 2012, 


published by UNICEF. 

UNICEF Infant (under 1) Total GNI per capita|Life expectancy] Total adult 

statistics mortality population | (US$) 2012 | at birth (years) |literacy rate (%) 
rate (deaths | (thousands) 2012 2008-2012 
per thousand 2012 
births), 2012 

Bangladesh 33 154695.4 840 70.3 57.7 

Brunei 

Darussalam x 412.2 = 78.4 95.4 

China 12 1377064.9 5740 75.2 95.1 
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UNICEF Infant (under 1) Total GNI per capita|Life expectancy| Total adult 
statistics mortality population | (US$) 2012 |at birth (years) |literacy rate (%) 
rate (deaths | (thousands) 2012 2008-2012 
per thousand 2012 
births), 2012 
Egypt 18 80721.9 3000 70.9 73.9 
India 44 1236686.7 1530 66.2 62.8 
Indonesia 26 246864.2 3420 70.6 92.8 
Malaysia 7 29239.9 9800 74.8 93.1 
Mauritius B 1239.6 8570 73.5 88.8 
Nepal 34 274744 700 68 574 
New Zealand 5 4459.9 30620 81 - 
Pakistan 69 179160.1 1260 66.4 54.9 
Saudi Arabia 7 28287.9 18030 75.3 87.2 
South Africa 33 52385.9 7610 563 93 
drinidadand 18 1337.4 14400 69.8 98.8 
Tobago 
United: Arab 7 9205.7 36040 767 90 
Emirates 
United 4 62783.1 38250 80.4 = 
Kingdom 
United States 6 317505.3 50120 78.8 = 
Zimbabwe 56 13724.3 680 58.1 83.6 


€ 
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a) Construct a stem-and-leaf diagram for the infant mortality rates for these countries. 


b) Construct a stem-and-leaf diagram for the total adult literacy rates (rounded to the 
nearest percent) for these countries. 


c) Construct a box-and-whisker plot for the 
Describe the skewness of the data. 


ife expectancy at birth for these countries. 


d) Construct a box-and-whisker plot for the GNI (Gross National Income) per capita 
for these countries. Describe the skewness of the data. 


e) Construct a box-and-whisker plot for the total population of these countries. 
Describe the skewness of the data. 


XAM-STYLE QUESTION 


In a particular week, a clinic nurse treats a number of patients. The length of time, 


to the nearest minute, for each patient’s treatment is summarised in the table below. 


Time (minutes) 3-6 


7-8 


9-10 11-12 


13-15 | 16-20 


Number of patients 15 


12 


17 15 


16 | 15 


Draw a histogram to illustrate these data. 
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8. The table below shows the points scored for the first three disciplines of the top twenty 
personal best decathlon performances of all time. Construct box-and-whisker plots 
of the data for the 100 metres, the long jump and the shot put in one diagram and 
compare the points scored in the three disciplines. 

Athlete 100m Long Shot 
Ashton Eaton 1044 1120 741 
Roman Sebrle 942 1089 810 
Tomas’ Dvorak 966 1035 899 
Dan O’Brien 992 1081 894 
Daley Thompson 989 063 834 
Jiirgen Hingsen 929 000 877 
Bryan Clay 1001 908 800 
Erki Nool 952 967 784 
Uwe Freimuth 847 007 870 
‘Trey Hardee 987 017 810 
Tom Pappas 910 050 869 
Siegfried Wentz 885 932 811 
Eduard Hamiliinen 975 876 854 
Dmitri Karpov 975 012 847 
Aleksandr Apaichev 870 952 851 
Frank Busemann 952 079 704 
Dave Johnson 870 918 766 
Grigori Degtyaryov 890 915 853 
Chris Huffins 1020 000 816 
Torsten Voss 931 030 788 


Comparing distributions 


Chapter summary 


Stem-and-leaf diagrams give the shape of a distribution in the same way as a histogram with 
equal intervals does, but also keep the detail available. 


Back-to-back stem-and-leaf diagrams allow distributions to be compared citing position of 
centre and spread in context. 


Box-and-whisker plots show only five ‘points’ from the distribution, so they are good for an 
overview in comparing distributions but very short of detail. 


Ina histogram, the area of each bar is proportional to the frequency in the interval. When 
the intervals are not of equal width the height of the bar is the frequency density that must be 
calculated. 


A cumulative frequency graph plots the interval end-points as the x-coordinate and the 
cumulative frequency as the y-coordinate. These points can be joined by straight lines or 
by a smooth curve and used to estimate medians and quartiles. 


Skewness is the term used to describe the lack of symmetry in a distribution. 
© A positively skewed distribution has a long tail to the right. 

oA negatively skewed distributed has a long tail to the left. 

© Asymmetric distribution has equal length tails. 

If you are comparing distributions, 

o make the comparison in context 


© make reference to both the average values and the spread. 
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Maths in real-life 


Seeing the wood and the trees 


The world is a complex place and almost all interesting problems in 
society involve multiple factors. More often than not relationships are 
not linear and there are interactions between effects. 


Statistics is becoming more complex as technology enables: 


more data to be collected 


analysis of large quantities of data to be undertaken on a useful timescale - who wants 
an accurate weather forecast based on meteorological data which is six days old? 


new data analytical techniques which rely on the data properties and not on 
assumptions in models which are only approximations to the distribution 


new ways to visualise complex relationships and to display more variables 
simultaneously. 


‘This display shows vividly how the health and wealth of nations is strongly correlated, but 
it contains much more information as well. The colour identifies the continent and the 
size of the bubble shows the population of the country. 


HEALTH 


EALTH « 


I More than 60 per cent 

41-60 per cent 

21-40 per cent 

1) 20 per cent or less 

© Data not available or 
child marriage not 
widely practised 


Geographical Information Systems (GiS) are being used increasingly to 
display a spatial component of data where this communicates an important 
feature of the full data set. These statistics produced by UNICEF show the 
proportion of women aged 20-24 in each country who were married or 

in union before the age of 18. 


Infographics is a buzzword when talking about representing data. It 
means communicating information visually through use of graphics (not 
always graphs - but any sort of images). By definition, infographics should 
communicate the information quickly and 
clearly, for example the graphic on the right gives 
us some quick and interesting facts on languages. 


‘These infographics are an easy way to get an 
overview of the information, although, as we 
saw in the introduction to Chapter 2, we should 
be careful not to rely too heavily on summary 
statistics as this may not give us the whole 
picture. 


The development of easy-to-use, free tools has 
made infographics more readily available to the 
public. Through social media, such as Facebook 
and Twitter, infographics can now be shared and 
spread quickly among many people, worldwide. 


English 


Seeing the wood and the trees 


Probability 


Objectives 


e Evaluate probabilities in simple cases by means of enumeration of equiprobable elementary 
events (e.g. for the total score when two fair dice are thrown). 


Tests for diseases can return false positives 

(a positive result when the patient does not 
have the disease) as well as positive results for 
patients who do have the disease. Conditional 
probability allows doctors to have some 

idea of the likelihood the patient really has 
the disease when a positive results occurs. 
Misunderstanding of conditional probabilities 
has caused serious miscarriages of justice; for 
example, in the UK, Sally Clark was wrongly 
convicted of murdering her two sons who died 
from sudden death infant syndrome. 


Use addition and multiplication of probabilities, as appropriate, in simple cases. 


Understand the meaning of exclusive and independent events, and calculate and use 
conditional probabilities in simple cases, e.g. situations that can be represented by means of a 


tree diagram. 


Before you start 
You should know how to: 


1. Work with fractions, e.g. calculate 


a) 241 b) 1Lx1 
ig 3 6 2 

a) 241-24-H 5) 1xiat 
ms @ 6 2 12 


2. Identify the basic outcomes of a simple 
experiment, e.g. 


How many different pairs of letters can be 
made from the word DICE? 


DI, DC, DE, IC, IE, CE 


Skills check: 


1. Calculate 


12 1 
142 b) 1 
) 45 4 


2. List the possible outcomes when a coin is 
tossed twice. 


4.1 Basic concepts and language of probability 


A probability experiment has outcomes which occur unpredictably. 


Imagine an experiment where you roll a die 500 times, and you see 74 ‘fives. 
The relative frequency or experimental probability of throwing a five in the 


’ a 
experiment is S00 > 0.148. 

However, provided the die is fair, the theoretical probability 
of throwing a five is 2 = 0.166.... So, if the die is fair, ‘on average’ 


you would see 2 of 500 = 83 fives. 


It might be argued that the die used in the experiment is slightly biased 
against rolling a five, but the evidence for this is not very strong. When the 
experiment is recreated, the number of fives varies considerably. 


To be more confident, you should roll the die more times: 7416 fives in 
50000 rolls would be stronger evidence that the die was not fair than 74 fives 
in 500 rolls, even though the proportion of fives is almost the same. 


The relative frequency of an event happening can be used as an estimate of the 


probability of that event happening. The estimate is more likely to be close to the 
true probability if the experiment has been carried out a large number of times. 


An event is a set of possible outcomes from an experiment. 
So for rolling a die, you could say: 

A is the event that a five is rolled. 

Bis the event that an even number is rolled. 

Cis the event that an odd number is rolled. 

A U Bis the union of events A and B. 


This means A or B or both can happen. 
Rolling 2, 4, 5, 6 are the outcomes which satisfy A U B. 


AQ Cis the intersection of events A and C. 
This means both A and C have to happen. 
Rolling 5 is the only outcome that satisfies A A C. 


A’ means the event ‘A does not happen. 
‘This is the complementary event, and P(A’) = 1 - P(A). 
Rolling 1, 2, 3, 4, 6 are the outcomes which satisfy A’. 


Note that BO C has no outcomes satisfying it — there are no numbers 
which are both even and odd. This can be written as BC = © or 
BO C= {} and is referred to as the null set or empty set. 


Probability 


Exercise 4.1 
1. A normal die is thrown. The events A, B, C and D are defined as: 
A: A factor of 4 is seen. B: A square number is seen. 
C: A prime number is seen. —_D: A multiple of 3 is seen. 
a) For each event, A, B, C and D, write down the outcomes which satisfy it. 
b) Give the probability of each event A, B, C and D. 
c) List the outcomes which satisfy A MC. 
d) Write down P(A OC). 
e) Find P(AND). 
f) Find P(A UB). 
2. A letter is chosen at random from the word CAMBRIDGE. The events 
A, B, C and D are defined as: 
A: A vowel is chosen. 
B: The letter B is chosen. 
C: A letter in the first half of the alphabet is chosen. 
D: A letter is chosen which has only one letter beside it. 
a) Describe the event A’ in words. 
b) For each event, A, B, C and D, write down the outcomes which satisfy it. 
c) Give the probability of each event A, B, C and D. 
d) List the outcomes which satisfy A 4 C. 
e) Write down P(A NC). 
f) Find P(A D). 
g) Find P(A VU B). 
3. a) Tossa coin 20 times and count the number of times it shows heads. 
b) Toss the coin another 20 times and count the number of times it shows heads. 
c) How many times would you ‘expect’ to see heads in 20 tosses? 
d) Did you get this in both sets of 20 coin tosses? 
e) Ifyou have access to a number of other people's results as well - how 
often do you see the ‘expected number’ of heads? 
4. a) Rolla die 30 times and count the number of times it shows a five. 
b) How many times would you ‘expect’ to see a five in 30 rolls? 
c) Roll the die another 20 times and count the number of times it shows a five. 
d) How many times would you ‘expect’ to see a five in 20 rolls? 


e) Ifyou have access to a number of other people's results as well — how often 
do you see the ‘expected’ number of fives: 


i) in30rolls ii) in 20 rolls? 


(2m Basic concepts and language of probability 


4.2 Two (or more) events 


There are many contexts in which you are interested in the outcome of more 
than one thing at a time — that is, of a compound event. In simple cases it may 
be possible to make a list of all the possible outcomes. In order to make it easier 
to keep track, it is common to write the outcomes as ordered pairs (or larger 
groups, if more terms are needed) inside brackets, ( ), and to put the whole list 
inside curly brackets, { }. This list is called the sample space of the experiment. 


So for tossing a coin twice the sample space will be: {(H, H); (H, T); (I, H); (1, T)} 


In a context where two things happen and you 
combine them according to some rule, it is often 
helpful to construct a table showing the possible 
outcomes. This is sometimes called a sample 
space diagram (also referred to as a 

possibility space diagram). 


For example, a situation in which two dice are 
thrown and the sum of the scores is taken is 
represented in the two-way table shown here. 


Score on second die 


»| Sum |1/|2/3/41/5 | 6 
s 1 2/[3|4 7 
&|_2 3 | 4 Mem ol|7|8 
gl 3 4 Mc l|z7/s8i9 

e| 4 om 6 | 7/8] 9 | 10 
3 5 6|7] 8] 9] 10/11 

6 7 |8_\ 9. |.0;) aa. |e: 


‘The blue cell represents the ordered pair (1,5) - as indicated by the row and 
column headings — but you can then insert the value of interest inside the cell; 
in this case you can insert the sum of the two scores, which is 6. 
Since all 36 of the cells in the table are equally likely outcomes, you can use 
the table to work out the probability of getting a sum of 5 (the instances of a 
sum of 5 are in grey cells): 
4 

P(sum = 5) = a6 
The advantage of this approach is that the (row, column) pair identifies what is 
actually seen in the experiment, while the contents of the cell show the result of 
this ordered pair of outcomes, according to the particular rule to be applied. 


Probability 


This rule can change. For example, two dice are again thrown, but this time 
the higher of the scores on the two dice is taken: 


High| 1 | 2|3| 4/5/16 
1 [if2[3[4]s5le 
2 [2]/2/3]4[5]6 
3 [3/3 /3]4[5]6 
4 [4[4[4]l4a[5]e 
5 [5/5[5][5[5]6 
6 |6|6|6\;6/6|6 


You can use this table to work out the probability of the higher score being 5: 
ghee) 22. 
P(high = 5) = a6 


A two-way table can be used to show the possible outcomes of a 
compound event such as throwing two dice. 
Exercise 4.2 


1. ‘There are three starters on a restaurant menu: vegetable pakora (V), onion 
bhaji (O) and chicken tikka (C). Jen and Kay each order a starter. 


List the sample space of possible orders. 

2. In question 1, if there is only one of each of the starters left, list the sample 
space of possible orders. 

3. A set of six cards show the numbers | to 6. Two cards are taken at random. 


Copy and complete the sample space diagram to show the sum of the 
numbers on the cards. 


sum |1[2/3]4]5]|6 
1 | 4 

2 | 

3 | 

4 | | | 
ee 
6. | 8 


Find the probability that the total score is 
a) 5 b) 4 c) 2. 
4. A coin is tossed and a die is thrown. A head scores 1 and a tail scores 2. 


a) List all the possible outcomes in the sample space. 


b) Construct a sample space diagram to show the total scores for this 
experiment. 


(Yom Two (or more) events 


7. 


Two dice are thrown. 


a) Construct a sample space diagram to show the product of the 
scores on the two dice. 

b) Find the probability that the score is 
i) 3 ii) 5 iii) 6 iv) 10. 

Two dice are thrown. 


a) Construct a sample space diagram to show the lower of the scores 
on the two dice. 


b) Find the probability that the score is 
i) 3 ii) 5 iii) 6. 
Two dice are thrown. 


a) Construct a sample space diagram to show the (unsigned) difference 
between the scores on the two dice. 


b) Find the probability that the difference is 
i) 3 ii) 5 iii) 6. 


4.3 Tree diagrams 


Consider a bag which has three red and five blue beads in it. 


Red 
If you take a bead out at random, and then take out another without fed 
replacing the first, you can represent the possible outcomes in a 
possibility tree diagram, as shown on the right. Blus 
Red 
You can put probabilities on the branches to complete the diagram: its 
Blue 


6 
7 Ret 6 
3 _ Red 
8 
2 Blue 18 
fi 56 
3 red 18 
5 i 56 
8 Blue 
2. Blue 22 
7 56 


Probability 


P(both beads the same colour) = P(both beads red) + P(both beads blue) 


Sampling with replacement 


If the first bead was returned to the bag before the second 
one was selected, the probabilities of red or blue would be 3 Red 


Ei 5 
g and at the second stage, as well as the first. 


Now, 


P(both beads the same colour) 
= P(both beads red) + P(both beads blue) 


ley 


Blue 


9. 5 25 
=242 

64 64 
= 34 

64 


If the first and second bead are returned to the bag, and a third 
bead is taken out, the tree diagram will look like this: 


27 


Red oy 


Red 
3 Blue => 


Red 


Red => 
5 
at a Blue 


Blue => 


Red => 


Red 


I 
on] oo 


mto/\ ore oto/\ote oto/\ oie wolon/ \\coles 
ae ae 
a Bl 


Blue => 
Blue 


Red => 


colon 


Blue 


Blue => 


(sam Tree diagrams 


Cory 
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Example 1 

A disease is known to affect 1 in 10000 people. It can be fatal, 
but it is treatable if it is detected early. 

A screening test for the disease shows a positive result 

for 99% of people with the disease. 


‘The test shows positive for 2% of people who do not 
have the disease. 


For a population of one million people, 


a) how many would you expect to have the disease 
and test positive 
b) how many would you expect to test positive? 


First draw a tree diagram: 


99 
have disease 
test positive 


2 

has disease 

tests negative 
1 000000 
population 

19998 
do not have disease 
test positive 


999900 
do not have disease 


979902 
do not have disease 
test negative 


So there are likely to be about 99 positive tests from people with the disease, 
and about 99 + 19998 = 20097 positive tests altogether. In this scenario, fewer 
than 1 in 200 positive tests is from a person with the disease, despite the test 


being very accurate. 


Probability 


More complex tree diagrams 


Example 2 
There are four red, three green and five blue discs in a bag. Two discs are drawn out. 
Find the probability that two discs the same colour are drawn. 
First draw a tree diagram - there are three outcomes at each stage: 
12 
Red To 
12 
Red Green a2 
& 20, 
mm EI he 
12 
Red 335 
6 
Green Green 735 
15 
Blue 333 
20 
Red 55 
Blue pee 
Green 735 
+ 20 
Ti Blue 333 
P(same colour) = 22 + 6 + 20 = 38 . 19 
1S25 a2) 1320152) /60) 


Tree diagrams are useful when you know the probabilities of each stage of 


compound events. You multiply along the branches to get the probability 
of a pathway, and the probabilities of different pathways can be added. 


Exercise 4.3 
1. A bag contains five blue and three green balls. 
A ball is chosen at random and the colour noted, then the ball is returned to the bag. 


A second ball is chosen. 
a) Find the probability that the two balls are different colours. 


b) Ifthe first ball is not returned to the bag, what is the probability 
the two balls are different colours? 


Tree diagrams 


2. Ata gym, 60% of the members are men. One third of the men use the gym at 
least once a week. Three-quarters of the women use the gym at least once a week. 


A member is chosen at random. Find the probability that 
a) it isa man who does not use the gym at least once a week 
b) it isa person who uses the gym at least once a week. 
3. Fora person living in a particular town, the probability that during a period of 


one year they will start a new job is 0.06. The probability that they will be fired 
from a job is 0.03. 


Assuming these events are independent, draw a tree diagram to represent 
this information. 


Find the probability that during one year a randomly selected person living 
in the town has 


a) neither of these events happen 
b) exactly one of these events happen 


c) both of these events happen. 


4. Acoin is tossed three times. Find the probability that 
a) it shows heads on all three tosses 
b) it shows the same on all three tosses 
¢) it does not show the same on two successive tosses. 
5. A bag contains ten counters: four white, three green and three red. Counters are 
removed one at a time at random, without replacement. Find the probability that 
a) the first counter drawn is red 
b) the first three counters drawn are all white 
c) the first three counters drawn are all different colours. 


4.4 Conditional probability 


In Example 1 you considered a screening test for a disease and found 
that, even though the test is very accurate, the fact that the disease 

is rare — there are many more people who do not suffer from it than 
people who do — means that fewer than 1 in 200 of those testing positive 
actually have the disease. 


19998 out of 20097 positive test results were from healthy people, so the 
conditional probability that somebody is healthy given that they have a positive result is 


19998 
P(healthy | positive test result) = Ty 


= 0.9951 


The conditional probability of an event A occurring given that an event B has 


already occurred can be written as P(A|B). 


Probability 


Example 3 
A medical centre encourages elderly people to have a flu vaccination each year. 
The vaccination reduces the likelihood of getting flu from 40% to 10%. 
If 45% of the elderly people visiting the centre have the vaccination, find the 
probability that an elderly person chosen at random 
a) gets flu b) had the vaccination, given that they get flu. 
Flu 0,045 
OL 
Vaccination 
0.45 
oe No Flu 0.405 
Flu 0.22 
0.55 oe 
No 
Vaccination 
0.6 
No Flu 0.33 
P(VAF) _ 0.045 
P(F) = 0.045 + 0.22 = 0.265 b = = = = 0. ise 
a) P(F) +: ) RO aay nas OO) 


The conditional probability of V given F is P(V|F) 


Exercise 4.4 
1. From a sample taken, 95% of car drivers wear seat belts, 60% of car 
drivers involved in serious accidents die if they are not wearing a seat 
belt, and 80% of those that do wear a seat belt survive. 
a) Drawa tree diagram to show this information. 
b) What is the probability that a driver in a serious accident did not wear 
a seat belt and survived? 


2. A shopkeeper buys one third of his stock of light bulbs from Company X, 
and the rest from Company Y. 


An independent report states that 3% of light bulbs from Company X 
are faulty and that 2% from Company Y are faulty. 


a) Ifthe shopkeeper chooses a bulb at random from his stock and tests it, 
what is the probability that it is faulty? 


b) Ifthe bulb is faulty, what is the probability that it came from Company Y? 
Conditional probability 


3. The homework diaries and completed homework of two students, A and B, 
are examined. 
There is a probability of 0.4 that Student A does not make a note in her diary 
of the homework set. She always does the homework if it is written in her 
diary, but never does the homework if it is not written in. 
‘There is a probability of 0.8 that Student B writes the homework given in her 
diary. When she does this, she will do the homework 90% of the time. If she 
has nothing written in her diary then she checks with a friend, who knows 
what the homework is 50% of the time; Student B always does the homework 
if she is told what it is by her friend. 
Draw tree diagrams representing this information. 
a) Find the probability that Student B does her homework on a 
particular night. 
b) Find the probability that both students do their homework on a 
particular night. 


c) Ifone piece of homework was given to a student but wasn't done, 
find the probability that it was given to Student A. 


4. Ina school there are 542 students, of whom 282 are girls. Of the 542 students, 
364 walk to school, and 153 of those are girls. Find the probability that a student 
chosen at random 


a) isa boy 
b) isa boy who does not walk to school 
c) does not walk to school, given that they are a boy 
d) isa girl, given that they walk to school. 
5. ‘There are 173 students in one year in a school. From the year, 25 students play 
hockey, and of these 7 are in the school’s hockey team. 
Find the probability that a student chosen from the year at random 
a) plays hockey 
b) plays in the school’s team, given that they play hockey. 


6. Of the employees in a large factory one sixth travel to work by bus, one third 


by train, and the rest by car. Those travelling by bus have a probability of ri of 


being late, those by train will be late with probability = and those by car will 
be late with probability Te 


Draw and complete a tree diagram to show this information. Calculate the 
probability that an employee chosen at random will be late. 


Probability 


7. An insurance company separates car drivers into three categories: Category X is 
‘low risk, and this category represents 20% of drivers who are insured with the 
company; Y is ‘moderate risk’ and represents 70% of drivers insured; Z is ‘high risk. 
The probability that a Category X driver has one or more accidents in a 12-month 
period is 2%, and the corresponding probabilities for drivers in Categories Y and Z 
are 5% and 9%, respectively. 

a) Find the probability that a driver insured with the company, chosen 
at random, is assessed as a Category Y risk and has one or more 
accidents in a 12-month period. 

b) Find the probability that a driver insured with the company, chosen 
at random, has one or more accidents in a 12-month period. 

c) Ifa customer has an accident in a 12-month period, what is the 
probability that they are a Category Y driver? 


8. Two identical bags each contain 12 discs, which are identical except for 
colour. Bag A contains 6 red and 6 blue discs. Bag B contains 8 red and 4 blue discs. 


a) A bag is selected at random and a disc is selected from it. Draw a tree 
diagram illustrating this situation and calculate the probability that 
the disc drawn will be red. 


b) The disc selected is returned to the same bag, along with another 
two the same colour, and another disc is chosen from that bag. 
Find the probability that 


i) itis the same colour as the first disc drawn 
ii) bag A was used, given that two discs the same colour have been chosen. 


4.5 Relationships between events 


Events are often connected to each other by some type 
of relationship. The following are the main types. 


Independence 


Two events, A and B, are independent if the outcome of A does not 
affect the outcome of B, and vice versa. 


P(A | B) = P(A) & A and Bare independent, and 
P(B| A) = P(B) & Band A are independent. 


Together with the conditional probability definition given on page 72, this 
gives the multiplication law for independent events: 


If A and B are independent, P(A 7 B) = P(A) x P(B). 


ye@e Relationships between events 


Example 4 


A set of 40 cards shows a number, from 1 to 10, of one of four 
geometrical symbols. 


Circles and squares are shown in grey, rectangles and ellipses are blue. 


‘The pack of cards is shuffled and the top card is turned over. 
Let C be the event ‘the card shows a circle’ F be the event ‘the card 
shows 5 symbols’ and G be the event ‘the card is grey’ 


a) Show that C and F are independent events. 


b) Show that C and G are not independent events. 


a) P(C)= En (there are 10 circle cards) 


P(F) = a (there are 4 cards showing 5 symbols) 


P(COF)= a (there is only one card with 5 circles) 
1 


P(C| F) = RCOP _ 40 1 


Since P(C | F) = P(C), Cand F are independent events. 


b) P(G|C) =1, since if you know the card has circles then you know 
it is grey 
P(G)=0.5 
So P(G| C) # P(G), and the events G and C are not independent. 


Addition law for probabilities 

If we take all the outcomes that satisfy A, and then all the outcomes that 

satisfy B, then any outcomes which satisfy both will be double counted, so 
P(A U B) = P(A) + P(B) - P(A B) 

That is, the probability of A or B can be found by adding the probabilities 

of A and B and then subtracting what has been double counted. 


If A is the event a fair die shows a factor of 6 and B is the event that the 
fair die shows a square number, then 1, 2, 3 and 6 satisfy A, while 1 
and 4 satisfy B. A and B is satisfied by 1, and A or B is satisfied by 

1, 2, 3,4 and 6. 


P(A UB) =25 P(A) =25 P(B) = 2 P(AIB) =<. 


ale 


Bp 4 2 
ral eee ee 
6 6 6 


Probability 


The addition law for probabilities is: 
P(A UB) = P(A) + P(B) - P(A B) 


Mutually exclusive 


Two events A and B are mutually exclusive if they cannot occur at the 


same time. A and B are disjoint events if they are mutually exclusive. 
P(A UB) = P(A) + P(B) 


Exhaustive 


A set of events is exhaustive if they cover all possible outcomes. 


Example 5 


Two fair dice are thrown. From the events listed, give two which, when taken together, are 


a) mutually exclusive 

b) exhaustive 

c) not independent. 

A: The two dice show the same number. 

B: The sum of the two scores is at least 5. 

C: At least one of the two numbers is a 5 or a 6. 
D: The sum of the two scores is odd. 

E: The largest number shown is a 6. 


F: The sum of the two scores is less than 8. 


a) Aand D are mutually exclusive: if the two dice show the same number the sum 
has to be an even number. 


b) Band Fare exhaustive: the only outcomes not in B are that the sum is 2, 3 or 4 
and these are all in F. 


c) Aand Dare not independent (because they are mutually exclusive). 
Cand E are not independent, since P(C|E) = 1 (if E happens, then 


you know C must happen). 


Relationships between events 


Here are some other terms that you need to be aware of: 


Partition: a group of sets which are exhaustive and mutually exclusive form 
a partition. The whole outcome space has been split into disjoint events, 

so their probabilities total 1, and there is no overlap between any pair. 
Compound events can be evaluated simply by going through the group 

and seeing whether each set is to be included. 


Complementary events are a special case of a partition; they are a two-event 
partition. If A and B are complementary then P(B) = 1 - P(A). The simplest 
way of representing a complementary pair is as ‘A and “not A”. 


Example 6 
Over the course of a season, a hockey team play 40 matches, 
in different conditions, with the following results. 


Weather 
Good | Poor 
Win 13 
Draw 5 
Loss 
Total 25 15 


Total 


For a match chosen at random from the season: 
Gis the event ‘Good weather’ 
W is the event “Team wins’ 
Dis the event “Team draws’ 
Lis the event “Team loses. 
a) Find the probability in each case. 
i) P(G) ii) GAD) _ iii) P(DIG) 
b) Are the events D and G independent? 


a) i) During the season, 25 matches are played in good weather, so P(G) = = = 2 


ii) GO Disa draw played in good weather, and there are 5 of those, so P(G M D) = 
P(GAD) 
PG) 


iii) P(D|G) = 


are independent. 


Probability 


Exercise 4.5 
1. Aand Bare independent events. P(A) = 0.7, P(B) = 0.4 
Find: 
a) P(AQB) b) P(AUB) c) P(A’). 
2. P(A) =0.7, P(B) = 0.4, P(A UB) = 0.82 
Show that A and B are independent. 
3. P(A) = 0.5, P(B| A) = 0.6, P(B’) = 0.7 
Show that A’ and B are mutually exclusive. 
4. Xand Yare independent events with P(X) = 0.4 and P(Y) = 0.5. 
a) Write down P(X|Y). b) Write down P(Y|X). c) Calculate P(X’ AY). 


5. The results of a survey of colours and types of cars are shown in the table. 


Saloon | Hatchback 


Silver 65 59 
Black 27 22 
Other 16 19 


One car is selected from the group at random. 
a) Find the probability that the selected car is 
i) asilver hatchback 
ii) a hatchback 
iii) a hatchback, given that it is silver. 


b) Show that the type of car is not independent of its colour. 


6. Consider the following possible outcomes of rolling a blue die and a white die: 
A: The total is 2. 
B: The white die shows a multiple of 2. 
C: The total is less than 10. 
D: The white die shows a multiple of 3. 
E: The total is greater than 7. 
F: The total is greater than 9. 


Which of the following pairs of events are exhaustive? Which are mutually 


exclusive? 

a) AB b) A,D 
°) GE d) GF 
e) BD f) AE 


Relationships between events 


7. The two sides of a coin are known as ‘head’ and ‘tail Four unbiased coins 


are tossed together. Possible events are: 
A: No heads. 

B: At least one head. 

C: No tails. 

D: At least two tails. 


Say whether each statement is true or false, giving a reason for your 


answer. 
a) Aand Bare mutually exclusive. 


b) A and Bare exhaustive. 
c) Band Dare exhaustive. 


d) A’and C’ are mutually exclusive. 


Summary exercise 4 


t EXAM-STYLE QUESTION 


i. 


A bag contains eight purple balls and two 

pink balls. A ball is selected at random from 

the bag and its colour is recorded. The ball 

is not replaced. A second ball is selected at 

random and its colour is recorded. 

a) Draw a tree diagram to represent this 
information. 


Find the probability that 
b) the second ball selected is purple 


c) both balls selected are purple, 
given that the second ball selected 
is purple. 


Walker’s disease is a rare tropical 

disease, known to be present in only 0.1% 
of the population. A new screening test 
has been analysed, and there is a 98% 
probability of testing positive when the 
person tested has the disease, and 

only a 0.2% probability of testing 

positive when the person does not have 
the disease. 


A person is selected at random 
from the population and given the 
screening test. 


a) What is the probability that the person 
will test positive? 

b) What is the probability that the person 
does not have the disease, given 
that they test positive? 

c) Jane is a doctor who is unhappy 
with guidelines which say that patients 
should be told immediately if the test 
shows positive. Explain how she 
could use the answer to part 
(b) to argue that these guidelines 
are not appropriate. 


: EXAM-STYLE QUESTION 


. Two events A and B are mutually exclusive. 
P(A) = 4, P(B) = 4 

a) Find P(A |B). 

b) Find P(A UB). 


c) Are events A and B independent? 
Provide a reason for your answer. 


Probability 


EXAM-STYLE QUESTIONS 
4. The events A and B are such that 


a) Drawa tree diagram to represent this 
information. 

b) Find the probability that a randomly 
selected rod is 
i) produced by machine Y and not 


excel =2 aBy)=t 
P(A) = = P(B) 5 and P(A AB) > 


a) Find: 
. me es defective 
i) P(ANMB) ii) P(A|B) ii) not defective. 
iii) P(B| A). c) Given that a randomly selected rod is 
b) State, giving a reason, whether or not not defective, find the probability 
Aand Bare that it was produced by machine Y. 
i) mutually exclusive 7. A golfer enters two tournaments. He 


ii) independent. estimates the probability that he wins the 
first tournament is 0.6, that he wins the 
second tournament is 0.4 and that he wins 
both tournaments is 0.35. 


a) Find the probability that he does not win 
either tournament. 


5. A computer-based testing system gives the 
user a hard question if they got the previous 
question correct and an easy question if they 
got the previous question wrong. The first 
question is randomly chosen to be hard or 


easy. b) Show, by calculation, that winning 


the first tournament and winning the 


= second tournament are not 
question right is = and the probability he independent events. 


The probability of Benni getting an easy 


gets a hard question right is + c) The tournaments are played in successive 
weeks. Explain why it would be 
surprising if these were independent 
events. 


a) Drawa tree diagram to represent what 
can happen for the first two questions 
Benni gets in a test. 


b) Find the probability Benni gets + EXAM-STYLE QUESTION 
his first two questions correct. 


©) Find the probability that the first : 8. ‘The events . and Bare — such 

question was hard, given that : that P(A) = z and P(B) = 5 

Benni got both of his first two : Find: 

questions correct. : a) P(ANB) 

$b) P(A’ B’) 
6. Ina factory, machines X, Y and Z are all ¢ 

producing identical metal rods. Machine o) P(A|B). 
X produces 25% of the rods, machine Y 9. A fair die has six faces, numbered 4, 4, 4, 5, 6 
produces 45% and the rest are produced by and 6. The die is rolled twice and the number 
machine Z. The production of rods from showing on the uppermost face is recorded 
machines X, Y and Z are 4%, 5% and 2% eachititne: 


defective, respectively. Find the probability that the sum of the two 


numbers recorded is at least 14. 


fii §=Summary exercise 4 


EXAM-STYLE QUESTIONS 


10. 


ll. 


12 


13. 


Events A, B and C are defined in the sample 
space S. 

Given that P(A) = 0.3, P(B) = 0.4 and 

P(A UB) = 0.65, find 

a) P(ANB) 

b) P(A|B). 

Aand C are mutually exclusive and P(C) = 0.5. 
c) Find P(AUC). 


If A and B are two events which are 
statistically independent, write down 
expressions for P(A > B) and P(A U B) 
in terms of P(A) and P(B). 

Each Friday, Anji and Katrina decide 
independently of one another whether to 
go to the cinema. On any given Friday, 
the probability of them both going to the 


a) 


b) 


cinema is = and the probability that at 
5 


least one of them goes is 


Find the possible values for the 
probability that Anji goes to the cinema 
ona particular Friday. 


. Of the students who took English in a certain 


school one year, 60% of them took History, 
30% of them took Biology, and 10% took 
both History and Biology. 
One of the students taking English is chosen 
at random. 
a) Find the probability that the student took 
neither History nor Biology. 
b) Given that the student took exactly 
one of History and Biology, find the 
probability it was History. 


Agneska has a spare ticket for a concert 
tomorrow and phones her friend Venus. 
It is equally likely that Venus will answer 
immediately or the phone will go to 


14, 


15. 


voicemail. If the phone goes to voicemail 
there is a probability of 0.7 that Venus calls 
Agneska back today. If Venus speaks to 
Agneska today she will attend the concert. 
i) Find the probability that Venus attends 
the concert. 

ii) Find the conditional probability that 
Agneska left a voicemail given that Venus 
attended the concert. Give your answer 


correct to 2 decimal places. 


Jamie tosses a biased coin and throws two 
fair dice. The probability that the coin shows 


a head is } Each of the dice has six faces, 


numbered 1, 1, 2, 3, 3 and 4. Jamie's score 

is calculated from the numbers on the faces 
that the dice land on, as follows: 

e if the coin shows a head, the two numbers 
from the dice are added together; 

e if the coin shows a tail, the two numbers 
from the dice are multiplied together. 

the probability that the coin showed a 
given that Jamie scores 6. 


Find 
head 


Data about age of males and females in a 
small rural area are shown in the table. 


Under 35 | 35 and over 


Male 345 | 380 


Female | 362 | 472 


A person from this area is chosen at random. 

Let M be the event that the person is male 

and let Y be the event that the person is 

under 35. 

i) Find P(M). 

ii) Find P(M and Y). 

iii) Are M and Y independent events? Justify 
your answer. 

iv) Given that the person chosen is under 
35, find the probability that the person is 
female. 


Probability 


81 


: 16. Each mother in a random sample of mothers : i) What is the modal age group? 


was asked how old she was when her first : ii) How many mothers were between 20 and 
child was born. The following histogram i 25 years old when their first child was 
represents the information. i born? 
45 + i iii) How many mothers were in the sample? 
; : 
4 eral 3 iv) Find the probability that a mother, 
= 3 : chosen at random from the group, was 
Hi pee | : between 20 and 25 years old when her 
= 1.5 : first child was born, give that she was not 
3 as ; older than 25 years. 
“ : : >: 
10 20 30 40 ©50 60 
Age (years) 
Chapter summary 


e ‘The relative frequency of an event happening can be used as an estimate of the probability 
of that event happening. The estimate is more likely to be close to the true probability if the 
experiment has been carried out a large number of times. 

e A two-way table can be used to show the possible outcomes of a compound event such as the 
total score when throwing two dice. 

e Tree diagrams are useful when you know the probabilities of each stage of compound events. 
You multiply along the branches to get the probability of a pathway, and the probabilities of 
different pathways can be added. 


e The conditional probability of A given B is P(A|B)= ECs) 


P(B) 

Be careful to work out the probability of both A and B happening directly and not from the 
probabilities of A and B happening individually. 

e Two events, A and B, are independent if P(A | B)=P(A), but if A and B are independent 
events then P(A B)=P(A) x P(B). That is, knowing that B has happened has given no 
information about the likelihood of A happening, and vice versa. 

e The addition law for probabilities is: 

P(A UB) = P(A) + P(B) - PAM B) 
e Two events A and B are mutually exclusive if they cannot occur at the same time: 
P(A UB) = P(A) + P(B) 


e A set of events is exhaustive if they cover all possible outcomes. 


Summary exercise 4 


Probability distributions and 


discrete random variables 


€ @ t2aii748 ERXNA 


To JIAI : : : . 
(0. JIANGUOMEN To CHONGWENMEN Simulations are now in widespread use to 


model what might be expected to happen 
if different strategies are used in dealing 
with a problem. For example, underground 
metro and subway stations around the 
world often have to undertake important 
engineering works. The use of random 
variables (both discrete and continuous) 
and their probability distributions is 
integral to developing good models of 
people's behaviour so the maintenance 
work causes as little disruption and 
inconvenience as possible. 


Objectives 


Be able to define a discrete random variable. 


Construct a probability distribution table relating to a given situation 
involving a discrete random variable X. 


Calculate E(X), the mean or expected value of X. 
Calculate Var(X), the variance of X. 


Before you start 
You should know how to: Skills check: 


1. 


Substitute into simple expressions, 1. Write down the values of 12 for 
e.g. find the value of the expression x=1,2,3and4, 7 
eh “ 1 when x= 2 
x wtl «+2 
Dye Dee 
2 (241) (242) 2 3 4 
64443 _13_41 
12 12 12 
Solve linear simultaneous equations, 2. Solve the simultaneous equations 
e.g. solve the simultaneous equations 0.6+a+b=1 
x+3y=9 (1) 3.24+2a+ 4b=4.6 
2x-y=4 (2) 


Multiply (1) by 2:2x+6y=18 (3) 
Eqn (3) — Eqn (2):7y=14 > y=2 
Substitute 2 for yin(1):x+6=9 => x=3 


5.1 Discrete random variables 


A random variable is a quantity that has its value determined by the 
outcome of a random event. 


Random variables can arise from probability experiments. For example, 
when you throw two dice, 
X = the sum of the scores 
is a random variable. 
Similarly, 
Y = product of the scores, and 
Z = larger of the scores 
are also random variables. 
Random variables can arise from real-life observation. For example: 
X = number of telephone calls arriving at a switchboard between 10 
and 10.30 am. 


When values of a variable have a probability attached, they form 
a probability distribution. 


X is a discrete random variable if X takes values x,, xX, X,, ..-, 


and P(X = x,) = p, where all p, > 0 and Ye, =1. 


Note that this means X can only take distinct values, all probabilities 
of X must be non-negative and their sum is 1. 


Example 1 


X is a random variable with probability distribution given by: 


x =| at || 6. [tS 
P(X=x) | 0.1 | 01 | 04) a | O1 


Find the value of a. 


seneeeccevcssceseecseccees sevecscceescsscesscscseessscaesssncsscsracsesssssscsssscsessssssessssssesesssscescad 


La = 


1-(0.1+ 0.1+ 0.4+ 0.1) =0.3 
Soa=0.3 


Discrete random variables 


Example 2 
If X = sum of the scores on two dice, the sample space for X will be: 


3 | 5 | 6 
| 7 


Hers 


10 
10 | 11 
n] 12 


Find the probability distribution for X. 


The probability distribution for X is: 


x [2[3 [4 


1 2 3 
36 36 36 


P(X =x) 


Exercise 5.1 


1. Find which of the following are discrete random variables, giving a reason 
for any which are not. 


a) [x tf[2fail4als 
p(x=x) | 02 | 03 | 04 | 03 | 02 

b) |x 2[-1[ 0] 1 [2 
p(x=x) | 02] 03 | 01 | 03 | 01 

D fe 5 | 7 [ 10 | 15 [ 20 
rfiflifafa 

PX=%) | 2 | 4 | 3 | ie | i6 

Y [x [if2[3 [415 
p(x=x) | 02 | 03 | 04 | 03 | -0.2 

2. A fair die is thrown. List the probability distribution for the following random variables: 


a) X=score on the die 
b) Y=2 score on the die 
c) Z=square of the score on the die 


@) ‘ if the score on the die is a factor of 6 


1 otherwise 


Probability distributions and discrete random variables 


3. Two coins are tossed. If X = number of heads seen, list the probability 
distribution for X. 


4. a) 
% 1 2 3 A 5 


P(X=x) | 0.2 | 01 | 03 | a | O1 
i) Finda. ii) Find P(X>2). 


») [x 2] -1] 0 
P(X =x) k 2k 2k k 
i) Findk. ii) Find P(X < 0). 
c) 


x 5 6 8 9 12 
P(X =x) 0.4 | 0.2 | 0.3 a 0.1 


i) Finda. ii) Find P(X > 9). 


5.2 The probability function, p(x) 


You can write the probability distribution of a discrete random variable as 
a possibility space diagram- a table of values with their associated probabilities 
a probability function - a formula p(x). 

Here is an example of a probability function: 
P(X =r) =kr FH, 2; 3,4: 

This means that the random variable X can take the values 

1, 2, 3 or 4 with probabilities k, 2k, 3k and 4k. 

You can then draw up a table showing all the probabilities explicitly: 


r 1 | 2]3 ]4 
Probability | 0.1 | 02 | 03 | 04 


A probability function will often have an unknown constant in its 
expression, which can be found by setting the total probability equal to 1. 


Example 3 


pxx=r)=(2)(2) T= lense 


a) Write down the probabilities that X = 1, X = 2 and X =3. 
b) Find the probability that X > 4. 


> Continued on the next page 


fay Discrete random variables 


5 P(X =o-()(5) 
\e)-3 
Pe(x=3)=(2)\(8) =a5 


b) P(X> 4)=1-P(X=12,3)=1-(2 


6 


Example 4 
‘The random variable X has probability function 


a for k=2,3,4,5,6,7 
P(X=k)= nop 
SS for k=8,9,10,11,12 


Find: a) P(X=3) b) P(X<5). 


a) P(X=3 


Sec eae ao 20nd 
b) P(X<5)=P(X=2,30rd)= 2+ 245-8 =2 


Exercise 5.2 


1. Which of the following could not be the probability function of a random 
variable? For those which could be, find the probability distribution. 


a) P(X=r)=kr for r=1,2,3,4,5 
b) P(K=r)=4 for r=1,2,3,4 
c) P(X = 1) = for r= 1, 2,3,4 
4) P(X=n)= for r=1,2,3,4 


2. For each of the following probability functions, list the probability distribution. 


a) P(Z=2)= 22 for z= 1,2,3,4 

b) P(Y= y)= for y= 1, 2,3,4,5 
erie: k(w-1) for w= 2,3, 4,5, 6,7 

e) PQWsni) = k(13-—w) for w =8,9,10,11,12 


d) P(H=r)=k(2r-1) forr =1,2,3,4 


Probability distributions and discrete random variables 


3. X isa discrete random variable with probability function 
P(X =x) = 7. for x = 1, 2,3, 4,5. 
A is the event X > 3 and B is the event X < 4. 
Find 
a) P(A) b) P(B) c) P(AMB) d) P(A|B) e) P(BIA). 
4, Zisa discrete random variable with probability function 
P(Z=2)=£ for z= 1,2,3,4. 
a) Show that k = 0.48. 
b) Find 
i) P(Z>1) ii) P(Z=4|Z> 1). 
5. P(Y=y)=cy for y=1, 2,3, 4. 
Find the value of c and hence find P(Y < 3). 


5.3 Expectation of a discrete random variable 


‘The mean or expected value of a probability distribution is 


defined as «=E(X) =>\x,p,. 


This means that we: 

multiply each value by its probability and 
sum over all possible values of the random 
variable. 


Example 5 

Members of a public library may borrow up to five books at any one time. 

The number of books borrowed by a member on each visit is a random variable, X, 
with the following probability distribution: 


xX 0 1 2 3 4 5 
Probability | 0.24 | 0.12 | 0.14 | 0.30 | 0.05 | 0.15 


Find the mean of X. 


E(x) 


= (0 x 0.24) + (1 x 0.12) + (2 x 0.14) + (3 x 0.30) + 


is ~~ We multiply each value by its 
(4 x 0.05) + (5 x 0.15) = 2.25 probability, and take the sum of these. 


Expectation of a discrete random variable 


Example 6 


X is a discrete random variable with probability distribution 


x | 1]|]2)])3) 4 
p | 5k | 2k | k | 2k 


Show that k = 0.1 and calculate E(X). 


ete eles Be The sum of all the probabilities will be equal to 1, 


and so we can use this to find a value for k. 


2 
0.2 


Now that we have a value for each probability, we 


E(X) = (1 x 0.5) + (2x 0.2) + (3 x 0.1) + can find the mean by using the method above. 
(4x 0.2) =2 


To find E(X*) - which will become useful in the next section - you can list the values 
of X* with the probability distribution of X. 
So, for the library books from Example 5: 


x? 0 1 4 9 16 25 
Probability | 0.24 | 0.12 | 0.14 | 0.30 | 0.05 | 0.15 


E(X2) = (0 x 0.24) + (1 x 0.12) + (4 x 0.14) + (9 x 0.30) + (16 x 0.05) + (25 x 0.15) = 7.93 


Example 7 


X isa discrete random variable with probability distribution 


x 


p | a {03} b }o2 


E(X) = 2.7. Find the values of a and b. 


a+0.3+6+0.2=1 and E(X) =a+ 0.6 + 3b + 0.8 =2.7 

je ae Sate te wo resus a simitaneous equations, 
a+3b=13 

and 2b=0.8 


b=04 
a=0.1 
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Exercise 5.3 


1. For the following probability distributions, calculate the expectation of X. 


a) 
x i |/a@ils | 4)]4 
P(X=x) | 0.2 | 0.1 | 0.2 | 03 | 02 
b) 
x 3) =i | o | | So 
P(X=x) | 0.2 | 03 | 01 | 03 | 01 
c) 
x 5 | 7 | 10 | 15 | 20 
vfafalafa 
P(X=2) | 3 4 | 8 | 16 | 16 


2. For each of the following probability functions calculate the mean. 


a) P(Z =z) = 22 for z= 1, 2, 3,4. 
b) P(Y=y)=4 for y= 1, 2,3, 4,5. 
3. a) 
x 1f2]3]4]s 


p(x=x) | 03 | 02] 01] a | o1 


i) Finda. ii) Find E(X). iii) Find EQC), 
> ie s|6|s8| 2 

px=x) | k | 2k | 2k | k 

i) Findk. ii) Find E(X). iii) Find EQ), 


4. Y is the larger score showing when two dice are thrown. Calculate E(Y). 


5. X isa probability distribution function and E(X) = 3.4. 


x 1 2 3 4 
P(X=x) | 0.1 | 02] 01 | a 
Find a and b. 


6. Xisarandom variable and E(X) = 7.4. 


x 4 | 6] 7 | 10} 1 
P(X=x)| 0.2 | a | 03 | b | 02 


Find a and b. Find the probability that X > E(X). 


Expectation of a discrete random variable 


5.4 The variance of a discrete random variable 


‘The variance of a probability distribution is defined as 
Var(X) = E[{X - E(X)}]. 


‘The alternative version (which is easier to use in practice) is 


Var(X) = E(X*) - {EOP 


An easy way to recall the alternative version is: 


Find the mean of the squares, then subtract the square of the mean. 


Example 8 

X isa random variable with probability distribution 
x[ 1 | | 3 | 4 
p| a | 02] 3a | 02 
a) Find the value of a. b) Calculate E(X) and Var(X). 


a) Since Yep =1, b) In table form: 
4a+04=1, 
soa=0.15 


E(X) = >. px = 2.7 
Var(X) = E(X?) - {E(X)} = 8.2 -2.77=0.91 


‘The standard deviation is the square root of the variance. 


So in Example 8 it is 0.91 ~ 0.954. 
We often use Greek symbols as notation for the population mean, variance and standard 
deviation: 

E(X)=u 

Var(X) =o 

Standard deviation of X = 0 


Probability distributions and discrete random variables 
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Example 9 
X is a random variable with probability distribution 


x/1i1|[alai4 
p|a|o2)| b | 02 


If E(X) = 2.6, calculate 
a) the values of aand b 
b) Var(X) and the standard deviation of X. 


a) a+b+04=1 ~The sum of all the probabilities is equal to 1. 
at+b=06 
a+ 0.4 + 3b + 0.8 = 2.6 “———_ We know that the mean is equal to the sum of 
at+3b=14 the values multiplied by the probabilities. 


Var(X) = E(X2) - {E(X)}? = 7.8 — 2.6 = 1.04 


Standard deviation = ¥1.04 ~ 1.02 «Variance is the square of the standard deviation. 


The variance of a discrete random variable 


Exercise 5.4 
1. For each probability distribution, calculate E(X) and Var(X). 


a) 
x e [6 |_7 ie. 8 
p(x=x) | 01 | 02 | 03 | 03 | o1 
ae 2-4 [ot [2 
p(x=x) | 01 | 02 | 03 | 03 | o1 
c) 
x 1{2/3]4]s5 
rfafafafa 
P(X=x) | > | 4 | 8 | i6 | i6 


2. For each probability function, calculate the mean and variance. 
a) P(Z=2)=SS% z= 1,2,3,4,5 


b) P(Y=y)=2 y=1,2,3,4,5,6 


k(w-1) for w = 2,3,4,5,6,7 


9 rw= w= {try for w = 8,9,10,11,12 


3. a) [x 1f/2131]415 
p(x=x) | 02] 02] 02] a | o1 


i) Finda. ii) Find E(X) and Var(X). 


[x 3[4]5]6 
P(x=x) | k | 2k | 2k | k 


i) Findk. ii) Find E(X) and Var(X). 


4. Yis the smaller score when two dice are thrown. 
Calculate E(Y) and Var(Y). 


5. Xisarandom variable and E(X) = 3.7. 


® 1 2 3) 4 5 
P(X=x) 01 | 02] 01] a 
Find a, b and Var(X). 


6. Xisarandom variable and E(X) = 5.7. 


x I 2. 4 8 16 
P(X=x) o1 | a | 03] b | on 
Find a, b and Var(X). 
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Summary exercise 5 


: EXAM-STYLE QUESTIONS 5. A discrete random variable X has the 
: 1. The random variable X has probability probability distribution shown in the 
: function table below. 
P(x=x)=£ x=1,2,3,4. x 6|7[8]9 
a) Show that k = 2. p(x=x) | 0.1 | 02 | 03 | 04 
Find Find 
b) P(X <3) a) P(X=8) b) E(X) 
fo) EX). c) Var(X) d) E(5-2X) 
} 2. The random variable X has probability e) Var(5 - 2X). 
H function 


forx=3,4,5 6. a) A regular customer in a shop observes 
P(X =x) = Kl-x) forx=6,7,8 that the number of customers, X, in 
a a shop when she enters has the following 
where k is a constant. probability distribution. 


=, 
a) Show that k a No. of : ; a 3 : 
b) Find the exact value of E(X). customers 
¢) Find Var(X). Probability | 0.1 | 0.25 | 0.3 | 0.25 | 0.1 


: 3. A discrete random variable X has the 


probability distribution shown in the table. a) Pindithe meantend standard 


deviation of X. 


iad 8 10 15 ‘The same customer also observes that 
P(X=x) 0.4 a 0.6-a the average waiting time, Y, before being 
served is as follows: 


a) Given that E(X) = 10.2, find a. 


b) Find Var(X). 
No. of olil2}3]a4 
c) Find P(X<p-o). customers 
4. An unbiased die is rolled. For each of the Average wait 
following, state whether the random (minutes) ea 


variable is a discrete uniform distribution. 


ii) Find th iting time. 
X = The number of times the die is rolled H)) eine the man walting tine 


until a six shows. b) The customer decides that, in future, if 
Y = The score showing on the top face when there are more than two customers 

the dieis rolled. waiting when she arrives, she will leave 
and return another day. On a return 
visit, she will stay no matter what the 
length of the queue is. 


Z=7 —the score 
showing on the 


top face when 
the die is rolled. 


cya Summary exercise 5 


i) What is the probability of her leaving } EXAM-STYLE QUESTION 


the shop without waiting on her first 
visit? 


ii) What is the probability that there are : 


more customers in the shop when 
she returns for the second visit, 
following an occasion on which she 
has left without waiting? 


: EXAM-STYLE QUESTIONS 
; 7. The random variable X has probability 


distribution 
x 71] 8] 9 [i] u 
p(x=x) | 02] a [03] oi] b 


a) Given that E(X) = 9.05, write down two 
equations involving a and b. 

Find 

b) the value of a and the value of b 


c) Var(X). 

. The random variable X has probability 

function 
P(X=x)=HO2k x= 1,2,3,4,5. 

25 

a) Construct a table giving the probability 
distribution of X. 

Find 

b) P(2<X<5) 

c) E(X) 

d) Var(X). 


. The random variable X has the cumulative 


probability distribution 


P(X<x) Joa 0.3|0.35 0.4|0.6|0.7/0.9} 1 


Find 

a) the probability distribution of X 
b) EX) 

c) Var(X). 


: 10. The discrete random variable X has 


probability function 
P(X = x) = kx? 

where k is a positive constant. 

a) Show that k= =. 

b) Find E(X) and show that E(X?) = 11.8. 

c) Find Var(X). 


x= 1,2, 3,4, 


11. An independent financial adviser 


recommends investments in seven traded 
options. The number from which the client 
makes a profit can be modelled by the 
discrete random variable X with probability 
function 

P(X = x) = kx 
where k is a constant. 
a) Find the value of k. 


b) Find E(X) and Var(X). 


x=0, 1, 2, 3, 4, 5, 6,7, 


‘The total investment 
is $4000 and the 
return on each 
successful option is 
$1500. 


c) Find the probability that the client makes 
a loss overall. 


d) Find the mean and variance of the profit 
that the client makes. 


: EXAM-STYLE QUESTIONS 


: 12. A fair die has the numbers 1, 2, 2, 3, 3,3 on 
x | 10 12| 15 16|18| 20] 24] 25]: 


the six faces. The die is thrown twice and X 
is the sum of the two scores. 


i) Draw up a table showing the probability 
distribution of X. 


ii) Calculate E(X) and Var(X). 


Probability distributions and discrete random variables 


iii) Find the probability that X is more than: i) Draw upa probability distribution table for 
one standard deviation away fromthe } the number of green cards Marcus chooses. 
mean. i ii) Calculate the mean number of green 


: 13. A group of coloured cards has 3 red, 3 green ; cards Marcus chooses. 


and 4 blue cards. Marcus chooses 2 of the 
cards at random without replacement. 


Chapter summary 
e A random variable is a quantity that can take any value determined by the outcome of 
a random event. 
e Xisa discrete random variable if X takes values x,, x,,x,, ..., and P(X = x,) = p, 
where all p, > 0 and > p, =1. 
e A probability function will often have an unknown constant in its expression, which can 
be found by setting the total probability equal to 1. 
‘The mean or expected value of a probability distribution is defined as u=E(X)= ey Pj. 


‘The variance of a probability distribution is defined as Var(X) = E[{X - E(X)}]. 
‘The alternative version (which is easier to use in practice) is 
Var(X) = E(X*) - {EC)? 


e The standard deviation is the square root of the variance. 
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rmutations and combinations 


Campanology is the art of bell ringing — usually 
church bells. These bells are very heavy and 
each bell requires a person to swing it. The 
inertia of the bells is so great that the ringers 
have only a limited ability to advance or slow 
down their bell in the sequence they are rung 
in. In most cases the number of bells is limited 
(six is common) so the number of available 
notes is limited, as well as the ability to change 
the sequence. The study of permutations has 
been prominent in the evolution of bell ringing 
since the seventeenth century. An ‘extent’ is a 
sequence in which all possible permutations of 
the 1 bells in the tower have been rung exactly 


once each. With six bells an extent will take around half an hour, but the time taken for an extent grows 
very quickly as the number of bells increases — with eight bells it would take nearly a full day and night. 
By the time there are 12 bells it would be around 30 years! 


Objectives 


e Understand the terms permutations and combinations, and solve simple problems involving 


selections. 


e Solve problems about arrangements of objects in a line, including those involving: 


© repetition (e.g. the number of ways of arranging the letters of the word ‘“NEEDLESS’) 


© restriction (e.g. the number of ways several people can stand in a line if two particular 
people must - or must not - stand next to each other). 


e Evaluate probabilities by calculation using permutations and combinations. 


Before you start 
You should know how to: 


1. 


List the possible outcomes in a given 
situation, e.g. 

Three coins are tossed. List the possible 
outcomes. 

HHH, HHT, HTH, HTT, THH, THT, 
‘EPHy TTT 


Skills check: 


1. Two coins are tossed. List the possible 
outcomes. 

2. Accoin is tossed and a die is thrown. 
List the possible outcomes. 


6.1 Permutations of n distinct objects in a line 


If we want to know how many ways two objects can be arranged, it is easy 
to see that there are two possible orders — in the form AB and BA. 


If we have three objects, we can argue that there are three ways to place the 
first object. We then have two objects left to place in order, and we already 
know that can be done in two ways, which gives us a total of six possible orders. 


We can list these six permutations easily, however when we have a larger 
number of orders, it becomes more time consuming. The reasoning used 
above (known as inductive reasoning) is very powerful and much 
quicker — we will use it shortly to generalise these results to provide an 
expression for the number of orders of n distinct objects. 


If we apply the same reasoning to the first example, where we only had two 
objects, we can say that we had two ways to put in the first letter of the 
sequence and then only one way to put in the remaining letter so the 
number of permutations of two distinct objects is 2x 1 =2. 


A. | Bee 
A|[C/B 
[Bl alc] 
[Bic] a | 
[cla|s | 
C|BIA 


For each of the three starting letters, the two remaining letters are first in 
alphabetical order and then in reverse order. The number of permutations 
of three distinct objects is then 3 x (2x 1) =6. 


Inductive reasoning now lets us argue that the number of permutations 
of four distinct objects in a line will be 4x 6 = 24, as we can put the first 
object in four ways, and we know the other three can then be arranged 

in six ways in each case. 


For small n, writing this out in full is simple enough, but it takes a long 
time so mathematicians have introduced a notation called factorials 
to shorten it: 


n! = n(n —1)(n — 2)... 3.2.1, where n is a positive integer. 


For reasons which will become clear shortly, it is convenient to define 0! = 1. 


The number of ways of arranging n distinct objects in a line is 1! 


Permutations of n distinct objects in a line 


You should get to know the first few factorials (1, 2, 6, 24) and your calculator 
will give you the others. However, because the numbers grow so quickly, the 
calculator will display them in standard form once it does not have enough 
room to display the number in full — for example a calculator normally shows 13! 
as 6227020800 and 14! as 8.717829 12 x 10". 


Did you know? 

The speed at which the factorials grow is why the time to play an 
extent (discussed in the introduction to this chapter) becomes so 
large with relatively modest numbers of bells. 


Example 1 


a) In how many ways can five people sit on a row of five chairs? 


b) A race has four runners. In how many orders can they finish if they all finish at different times? 
c) How many ways can the letters of the word CAMBRIDGE be arranged? 


a) 5!=120 

b) 4!=24 : It does not matter if it is people or letters to be arranged in order — 
: for all of these it is arranging n distinct ‘objects’ in order in a line. 

c) 9!=362880 


Exercise 6.1 
1. Calculate 


a) 6! b) 3! ©) 12! 
2. Calculate 

7 5! 8! 

a) a b) at c) a 


3. How many ways can the letters of the word YEAR be arranged? 


4. There are eight athletes in a race. Assuming they all finish the race and 
they all finish at different times, how many results of the race are possible? 


5. A set of eight cards each have a different colour. The cards are shuffled randomly 
and then laid out in a line. How many different arrangements are possible? 


6. How many different numbers can be made using each of the digits 1 to 5 exactly once each? 
7. How many ways can the letters of the word MATHS be arranged? 


8. In how many ways can ten people sit on a row of ten chairs? 


Permutations and combinations 


100 


6.2 Permutations of k objects from n distinct objects 
ina line 


While in some contexts the full order of objects may be of interest, often not 
all of the objects will be needed. For example in competitions there may be medals for 
the first three places and so the ordering is only important for the first few places. 


Example 2 


Ina race with eight competitors how many ways can the gold, silver and bronze medals 


Gold can be any one of 8, then silver any one of the remaining 7 


8x7x6=336 ———* 
and bronze any one of 6. 


We talk about permutations of 3 from 8 distinct items to describe the 
above, and the general formula then is: 


The number of permutations of k 


objects from n distinct objects in 
n! 


(n—k)! 


The first can be any one of 71, then the next of —1 and 
so on until the Ath can be any one of n—k+1—- which 
can be expressed neatly like this using factorials. 


a line is given by "P, = 
Calculators will have a function key combination for "P, 
or you can calculate it using the factorials (or by cancelling 


| 
COC” if k is close to 1). 


Mathematicians define 0! to be 1. 


The number of permutations of 8 objects from 8 was 8!, but using the 


8 which will only equal 8! if 0! takes 
o! 


above formula it would be *P, = 
the value 1. 


Exercise 6.2 


1. Calculate, without using a calculator 


8! 15! 120! 6! 
a) = b c d 
) 7 ) 13! ) 119! ) 3! 
1 
2. Use your calculator to work out: a) e b) P, 


3. How many different five-letter arrangements can be made using the letters in CAMBRIDGE. 


4. How many different four-digit numbers can be made using the digits 1 to 9 at most once each? 


Permutations of k objects from n distinct objects in a line 


5. Ten men are in a doctor's waiting room which has a row of eight chairs. How many 
different arrangements are there of men sitting in the chairs? 


6. There are eight athletes in a race. Assuming they all finish the race and each 
athlete finishes at a different time, how many results of the top five finishers 
are possible? 


7. A set of eight cards each have a different colour. They are shuffled randomly 
and then three of them are laid out in a line. How many different arrangements 
are possible? 


8. A candidate in an examination has to answer four questions from a choice of six, 
and can answer them in any order. How many different orders can the candidate 
provide their answers in? 


6.3 Allowing constraints on permutations (for n distinct objects) 


Often there are constraints placed on the permutations requiring something to 
happen (or not to happen). 


Example 3 
There are six people including a Herta and wife, afailis a row of six chairs. In how many ways can 


Think of the married couple as one ‘object’ — so there are five 
objects to be placed in order. But for each position the pair 
can appear in, there are two orders for the husband and wife. 


ii) 6!=720 There are 6! ways the six people can sit with no constraints. 
2x5!=240 
i Of these 6! permutations, 2 x 5! have the married couple 
720-240=480 sitting together. 


So in the rest of the cases they are not sitting together. 


You would not normally be asked to calculate both of these — but if you 
were asked part (ii) you would be advised to find the number of times 
that they are together because trying to find a way of calculating how 
often they are apart directly is very difficult. 


The difficult part of these problems is identifying how to place the 
objects or choose the objects to satisfy the given constraints. 


Permutations and combinations 


Example 4 


Eight books are to be arranged on a bookshelf. Three of the books are by the same author. In how 
many ways can the books be arranged on the bookshelf if the books by the same author are to stay 
together? 


seeeeeeeceseesereeceees Se eeeeeenenceccecccececssccessceesseesssastsessasesesesessseseees seeeeeeee sereeseeeeeed 


3! x 6! = 6 x 720 = 4320 
“ * —— First treat the 3 books by the same author as one ‘object’ — 


and arrange the 6 objects on the bookshelf. Then for each one 
of those arrangements, there are 3! = 6 orders in which the 3 
books by the same author can be arranged. 


Example 5 


How many ways can the letters of the word NEPAL be arranged so that there is not a vowel at 
either end? 


3x2x 3! 


36 ways. 


We need to put a consonant in the first place (3 ways); then a 
consonant in the last place (2 ways) and then fill the remaining 
3 spaces with the 3 remaining letters (3! ways). 


Example 6 


How many five-digit even numbers greater than 40000 can be made using the digits 1 to 5 exactly 
once each? 


Ade eeeeneeeeereneeeeessensseeeseaseseeeseeeesssseeeeeseseesesseeeaesesssssnesassssssssasessessssesssssesesend 


1x1x3!+1x2x3! 


Here we need to consider two cases separately. First we need 

—s to consider starting with 4 and with 5 separately, because 
when the number starts with 4 then it must end with 2 to be 
even, so there are 3! ways to put in 1, 3 and 5. If it starts with 
5 then there is a choice of 2 digits (2 or 4) to end, and then 
you have to put in the other 3 digits. 


=6+12=18. 


Exercise 6.3 

1. A group of six people includes two men. In how many ways can they be 
seated in a row of six chairs so that the two men sit beside one another? 

2. A group of eight people includes two married couples. In how many ways 
can they be seated on a row of eight chairs if 
a) there are no restrictions 


b) the married couples each sit beside their partner? 


item Allowing constraints on permutations (for n distinct objects) 


3. Ten books have to be arranged on a bookshelf. Three of the books are poetry. In how 
many ways can they be arranged on the bookshelf if 


a) the three poetry books have to be together 
b) the poetry books are Volumes I, II and IH of a collection and must appear in order? 


4. How many five-letter arrangements which start with a consonant can be made from the 
letters of the word MATHS? 


5. How many five-letter arrangements which end with a vowel can be made from the letters 
of the word MATHS? 


6. How many odd five-digit numbers greater than 30000 can be made from the digits 1 to 5? 


7. The five digits 1, 2, 3, 4, 5 can be arranged to give many different five-digit numbers. 
a) How many different five-digit numbers are odd? 


b) How many of them start with a 4? 


8. Izzy has 14 different CDs, of which three are film soundtracks, five are jazz and 
six are classical. How many different arrangements are there of the 14 CDs if 


a) the classical CDs are kept next to one another 
b) the CDs of each type are kept together? 


9. A singer has seven songs in the set she is to perform. Three of them are ballads 
which have to be performed all together at the end of the set. How many 
different arrangements of songs can she have? 


6.4 Permutations when some objects are not distinguishable 


If we want to arrange the letters of MATHS, all the letters are different and we know 
there are 5! ways to do this. How many ways are there to arrange the letters of 
STATS where the letters are not all different? 


This is much easier than it appears if we reason as follows: suppose the letters are 
all distinguishable (think of them as S,T|AT'S,. Then we know that there are 5! 
permutations possible. Of those, the Ts appear as T, followed by T, and also as 

T, followed by T,. So we can divide the number of orders (5!) by 2! - the number of 
orders of the two identical Ts. We can then do the same where the two S letters 


appear, and divide again by 2!, so there will be J! = 30 ways to arrange the 
letters of STATS. 


212! 


The number of permutations of n objects, of which p are identical to one 
another, q of the remainder are identical to one another, and so on 
nl 


(with p + q +... =n, and the values of p, q etc. can be 1) is given by ae 
piqtris!... 
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Example 7 

Find the number of distinct permutations of the letters of the words 
a) OXFORD 

b) MISSISSIPPI 


! 
a) £ = 360 ———————_ Here there are six letters with two the same. 


b) 2 ue =34650  —<@——_ Here there are 11 letters altogether with 4, 4, 2 and 
4l4l21 1 of |, S, P and M respectively. 


Exercise 6.4 

1. Find the number of distinct permutations of the letters of the words 
a) TRINIDAD 
b) TOBAGO. 


2. Find the number of distinct permutations of the letters of the words 
a) ASSIST 
b) HINDER. 


3. The digits 3, 3, 3, 5, 6, 6, 7, and 7 are to be used to make an eight-digit code. How many 
distinct codes are there? 


4. Another eight-digit code consists of three ‘1’s and five ‘0’s. How many distinct codes 
are there? 


5. A 16-digit code consists of two blocks of eight. The first contains two ‘1’s 
and six ‘0’s and the second contains four ‘1’s and four ‘0’s. How many distinct 
codes are there? 


6. Another 16-digit code contains six ‘1’s and ten ‘0’s. How many distinct 
codes are there? 


7. There are ten spaces to park bicycles at a school. There are six bicycles on a particular day. 
How many ways are there to put these six bicycles into the ten spaces if 
a) there are no restrictions 


b) the four empty spaces are to be beside one another? 


17 Permutations when some objects are not distinguishable 


6.5 Combinations 


In many cases the order of selection matters (as the previous sections have dealt with), 
but in others the order of selection does not matter. When the order of selection 

does not matter we refer to the number of combinations — or the number of ways 

to choose k objects out of n. 


k}~ e(n—! 


n ! 
The number of ways to choose k from n distinct objects is "C, -{ } i 


You can see why this will be the case by considering the permutations of 
nl 


k from n distinct objects, which was "P, = but now each group 


n! 
of k objects chosen will appear in k! different orders, giving (n= 
k! k\(n-k)! 


n—k)” 


as the number of choices of k objects out of n. 


Example 8 


A committee of five people is to be chosen from eight volunteers. How many different committees 
are possible? 


—_——————_ Choosing five out of eight with no restrictions is always (3) ways. 
5 


A committee of five people is to be chosen from seven men and six women. How many different 
committees are possible which have three men and two women? 


(() =35x15=525 <@——. We have to choose three out of seven men and two out of six 
‘women. 


Permutations and combinations 


Example 10 


A committee of five people is to be chosen from seven men and six women. How many different 
committees will have a majority of men? 


vi 6 7 6 7 6 
[ -{ ef }e( }e( } } As with Example 9, we need to consider different committee 

3 2 4 1 5 9 —t—_ make-ups separately. 3, 4 or 5 men will give a majority. Each of 
= (35 ye 15) i: (35 x6) i) (21 S 1) those cases is calculated in a similar fashion to Example 9. 


=525+210+21=756 


Example 11 


A group of six women and four men are to a listen to a presentation. Seats are arranged in two 
rows of five. In how many ways can they be seated so there i isa majority of women in each row? 


If you choose 3 women and 2 men to sit in the mont row ed there will also be 3 women and 2 
men in the second row. Then for each of these choices of people in the rows, the 5 people in the 
row can be arranged in 5!=120 ways (for each row). 


rrerrrrrrrerrrrrrirtrtrerrrrtit | 


Choosing 3 women (from 6) and 2 men (from 4) can be done in °C, x‘C,= 206 = 120 ways, so 
altogether there are 120120 x 120 ways or 1728000 possible ways ine group can be seated. 


Example 12 


A committee of five people is to be chosen from eight volunteers, including a husband and wife. 
How many different committees are possible which do not include both husband and wife? 


PTI eee | 


. Ne 56 — 20 = 36 8 
5 Bi = — We know choosing five out of eight with no restrictions is (:} 


and if both are included then there are six others from which to 
choose the remaining three committee members. 


Example 13 
A committee of five is to be chosen from eight volunteers, including a husband and wife. If the 


committee members are chosen randomly, what is the probability that the committee includes 
both husband and wife? 


20 5 <¢———_ In Example 12 we saw that there were 56 possible committees 


altogether, of which 20 included both husband and wife. 


56 «14 


Combinations 


Exercise 6.5 


1. 


5. 


In a class of 30 students, four are chosen randomly to attend a 
presentation by a guest speaker. How many different selections 
are possible? 


Five cards are to be selected at random from a set of 20 different cards. 
In how many ways can this be done? 


a) Show (algebraically) that choosing r objects out of n distinct objects 
gives the same number of selections as choosing (n—r) objects 
out of n. 


b) Explain why (in practical terms) there must be the same number of 
possible selections in these two cases. 


Four letters are to be selected at random from the English alphabet. 
How many possible selections are there if 


a) there are no restrictions 
b) there has to be at least one vowel? 


Drug testing authorities arrive at a cycling race to test 20% of the riders. 
‘There are 25 riders, and those to be tested are selected randomly. 

How many possible combinations of riders are there who 

could be tested? 


A new strategy is to be adopted by the drug testing authorities so they 
always test the winner and runner up in a race and then select the rest of 
the 20% sample randomly. In the race with 25 riders, how many possible 
combinations of riders will be tested? Justify why the new strategy 
might have been suggested. 


A final observation: a lot of contexts involve a mixture of permutations and 
combinations. The key is to have a clear idea of how the process is to be carried 
out, including whether the order is important in any part of the process. It is 
also worth noting that there is often more than one strategy for identifying 

all the possibilities — but if they are correct then they will give the same 
number of possibilities at the end. 
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Example 14 


Find the number of ways of arranging five women and three men in a row so there are not two 


If you arrange the five women in order (5! ways to do this) and leave a space between each pair of 
women, with spaces at each end, there are then 6 spaces in which a man can be put and satisfy the 
required condition. 
Strategy 1: 

6 5 
Choose 3 out of 6 spaces () = 20 ways and then arrange the 3 men in those 3 spaces 


(3! ways to do this). This gives a total of 5! x 20 x6 ways to arrange the group with no men standing 
together. 
Strategy 2: 


Put the first man in one of the six spaces, then the second man in one of the remaining 5 spaces 
and the last man can go in any one of 4 spaces, which also gives 5! x 6 x5 x 4 ways. 


6.6 Evaluate probabilities by calculation using permutations 
or combinations 


In this chapter so far we have concentrated on calculating the number of ways 
some event can happen. You will often be asked to use these principles to calculate 
a probability of seeing a certain event, where it can be calculated as the ratio of the 
number of outcomes that satisfy that event to the total number of outcomes. 


Example 15 
Five men and three women are arranged randomly in a row. What is the 
probability that there are not two women standing together anywhere 


In Example 14 the number of ways this restricted condition could occur was 5! x 20x 6. 


Putting 8 people into a row with no restrictions can be done in 8! ways so the required probability 


is the ratio of these two counts: 


! 
P(not two women standing together anywhere in the row) = Sait = = 


Exercise 6.6 


1. Three letters are selected at random from the word STUDY. What is the probability 
that the selection does not contain any vowels? 


2. Four letters are selected at random from the letters in the word CAMBRIDGE. 
What is the probability that the selection does not contain any vowels? 


its Evaluate probabilities by calculation using permutations or combinations 


3. A team of 5 is to be selected from a group of 11 people in which there are 2 girls, 
4 boys and 5 adults. If the team members are selected randomly find the probability that 
a) both girls are in the team 
b) there are no adults in the team 


c) there are at least two adults in the team. 


4. Acclass has 14 boys and 16 girls and 4 students are selected at random. 
What is the probability that the selection has exactly two boys in it? 


5. A standard pack of playing cards is well shuffled and dealt out equally to four players. 
What is the probability that a particular player has no diamonds? 


6. A group of eight people is made up of four husband and wife pairs. If they stand 
randomly in a line what is the probability that each husband will be standing 
next to his own wife? 


Summary exercise 6 


1. A soccer team is taking 16 people to a match 
in four cars. The owners of the cars drive 
their own cars, and each takes three other 
players as passengers. In how many ways can 
this be done? 


5. A group of 7 women and 3 men are toa 
listen to a presentation. Seats are arranged in 
two rows of five. 

In how many ways can they be seated so 
there is a majority of women in each row? 


2. A group of ten people consists of five 6. ‘The word CAMBRIDGE includes the 
married couples. There are arranged in a line, three vowels A, E and I and six different 
and randomly allocated a position. Find the consonants. 
probability that each wife is standing beside i) Find the number of different 
her husband. arrangements using all nine letters. 

EXAM-STYLE QUESTIONS ii) How many of these arrangements start 

and end with a consonant, and do not 

3. A three-digit number is made by choosing have the three vowels together. 
three different digits from 2, 3, 5, 7 and 8. 

7. Acar park at a convenience store has 14 


What is the probability that the number 


chosen will be divisible by 32 parking spaces in a row. There are 9 cars 


parked. 


4. Six cards numbered 2, 4, 6, 7,9, 11 are 
arranged randomly in a line. What is 
the probability that a card differs from a 
neighbour by only one? 


i) How many different arrangements are 
there for the 9 parked cars and 5 empty 
spaces? 

ii) How many different arrangements are 
there if the 5 empty spaces are next to 
each other? 
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iii) If the parking is at random, find the 
probability that there will not be 5 empty 
spaces next to each other. 


ii) How many different arrangements are 
there if the 5 lawyers all stand together 
and the 5 partners all stand together? 

iv) Do you think it would be reasonable to 


9. A group of 10 people consists of 2 boys, 
assume that the parking is random? 


3 girls and 5 adults. In how many ways can 


Five lawyers together with their respective a team of 4 be chosen if: 


partners all meet up for a meal. They arrange 
for a photograph to be taken of all ten of 
them standing in a line. 


i) both boys are in the team 


ii) the team consists of all adults or no 
adults 

i) How many different arrangements are iii) at least two girls are in the team? 

there if each lawyer is standing next to 


his or her partner? 


io) 


etme Evaluate probabilities by calculation using permutations or combinations 


Chapter summary 


e ‘The number of ways of arranging n distinct objects in a line is n! = n(n — 1)(n — 2)...3.2.1, 
where #1 is a positive integer. n! is read as n factorial. 


e Ol=1. 
e The number of permutations of k objects from n distinct objects in a line is given by 

np —__n! 

Da Gepl 

e ‘The number of permutations of n objects, of which p are identical to one another, q of the 

remainder are identical to one another, and so on (with p + q +... =n, and the values of 

Ep sian nt} 

P, q etc. can be 1) is given by ARSE 

e ‘The number of ways to choose k from n distinct objects is "C, = (?) = eae 


e Where there are constraints on the order or composition of a group, you need to identify 
how to place the objects or choose the objects to satisfy the given constraints. 


Permutations and combinations 


Maths in real-life 


Sporting statistics 


Scoring systems are used in all sports. In multi-disciplinary events, 
such as the decathlon and the heptathlon, these systems are 
particularly complex and make use of mathematical formulae. 


Both the decathlon and heptathlon have to 
transform performance in time for running 
different events and distances in throwing or 
jumping events into a points score for each 
component event. The winner is the person 
scoring the highest total number of points in the 
events. 


Ashton Eaton set a world record in the 
decathlon in June 2012, scoring 9039 points in 
ten events over two days. 


Jessica Ennis ran the fastest ever 100 metres 
hurdles in the 2012 Olympics on her way to 
winning the gold medal with a score of 6995 
points from the seven events. 


Some heptathletes and decathletes are stronger 
in sprints, some in distance running and others 
in throwing events, and the scoring system has 
to try to reward performances in all disciplines 
on a reasonable basis. 


In running, good performances are low times 
while in throwing and jumping the good 
performances are larger distances or heights, 

so a different approach is needed for these two 
groups of events and different formulae are 
required for each. 


This table shows the points earned in each discipline by the 
five best decathletes of all time (up to July 2014). You can see 
that there is considerable variation between athletes in 

a discipline and there is also variation between each discipline. 


beam Maths in real-life 


Athlete 100m Long Shot High 400m 110h Discus Pole Javelin 1500m_ Total 


Ashton Eaton 1044 1120 741 850 973. 1014 «6722 = 1004721 850 
Roman Sebrie 942 1089 810 915 919 985 827 849 892 798 
Tomas Dvorak 966 1035 899 840 905 1010 836 880 925 698 
Dan O’Brien 992 1081 894 868 885 977 840,07 667 


Daley Thompson 989 1063 834 831 960 932. 799 910 817 712 


This graph shows the average score in each of the ten decathlon events 
for the top 75 personal best scores (i.e. these scores are from 75 different 
athletes). You can see that there is a considerable amount of variation 
across events. 


Decathlon average points per event 
1000 5 


900 + 


800 + 
700 4 
600 4 
500 4 
400 + 
300 + 
200 4 
100 4 

oF 


Points scored 


100m Long Shot High 400m 110h Discus Javelin 1500m 


Event 


Is this fair? 


The formulae used to calculate points for each discipline have remained 

the same since 1984. In that time there have been enormous increases in 
sports and medical sciences as well as in coaching regimes, and the advent 
of professional athletes. Some events are very technical, while others rely 

on differing combinations of strength, speed and stamina. The widely 
differing average scores taken over the 75 best ever decathletes suggests 

that performances in some events are rewarded differently, but changing 

the scoring system is not an easy task when these are the yardsticks against 
which all decathletes train. A similar story holds for the women’s heptathlon. 


9039 
9026 
8994 
8891 
8847 
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The binomial distribution can be used to 
analyse any situation where outcomes of 
interest can be classified into two categories — 
either something happens, or it doesn't. 

The observed number of times something 
has happened can then be compared with 

a model of what is expected to happen 
under normal circumstances. If what has 
been seen is unusual, for example a doctor 
has an unusually high number of patients 
dying or having to be transferred to hospital, 
then further investigation can take place. 
The explanation may be that the doctor is 
incompetent, or it may even lie in a doctor's 
excellence. 


Objectives 


e Use formulae for probabilities for the binomial distribution. 


e Recognise practical situations where the binomial distribution is a suitable model. 
e Use the notation X ~ B(n, p). 
e@ Use formulae for the mean and variance of the binomial distribution. 


Before you start 
You should know how to: Skills check: 


1. 


Use your calculator to work out factorials 1. Find the value of 
n 15 

and values ol } e.g. a) 10! b) } 
r 

find the value of: 


8 
a) 6! b) (3): 


a) 61=6x5x4x3x2x1=720 
b) 8 _ 8! _ (8x7x6x5x4x3x2x1) =5 
3151 (3x2x1)(5x4x3x2x1) 
Substitute values into simple formulae, 2. Find the value of M = kx(3 - x) when k = 10 
e.g. find the value of V = np(1 - p) when and x = 0.4. 
n= 20 and p = 0.3. 
V=20 x 0.3(1 - 0.3) = 4.2. 


7.1 Introducing the binomial distribution 


You can use tree diagrams to work out probabilities of events involving a 
number of separate stages. A special case is where each stage has only two 
outcomes of interest, and the probabilities of these two outcomes are the 
same at each stage (p and 1 — p). 


Even when there are more than two basic outcomes, they can usually 

be grouped into two categories: those which satisfy the event of 

interest (often called ‘success’), and those which do not (‘failure’). 

At any stage, P(failure) = 1 — P(success). 

For example, if the aim was to count how often a factor of 6 is thrown when 
a die is rolled, then throws of any of 1, 2, 3 and 6 will result in ‘success. 


So, in this case: P(success) = 2 and P(failure) = Z 
Some patterns begin to emerge when these probabilities are shown on the 
branches of a tree diagram and the number of successes is counted: 


Stage 1 Stage 2 Stage 3 
2 34 O28. 
z S x=3,P es 
2) s < 2.2.1 
= 1 - a es 
3 ZF reaper gears 
2 Be dl 2, 
s = mp porch oi 
2 : BS xn 2PHr Ses xs 
3 3 i 
ae | 
1 F xe1PrSxons 
. 2° 3° 3 
2. 8 2 
3 2 
s 
2 
+ 3 < -1p-in2.H 
a 1 x rs ages: 
F 3: 
2 i 2 
1 SS xetprax ox S 
1 ieee 
oy E x= 0,P=3 3 3 


Consider the paths with x = 2: there are three of them, and they all have Z, ; 
and : multiplied together, but in different orders. 
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All three paths with x = 1 have i + and 2 multiplied together, but again in 
different orders. 


There is only one path for each of x = 0 and x = 3. Respectively, they have 


wie 


is land s and 2) 2 and 2 multiplied together. 
3 3 33 3 


The number of paths corresponds to the number of different ways you 
can order the three probabilities, in each case. The 1, 3, 3, 1 sequence 
of the number of paths, for x = 0, 1, 2, 3, is one of the rows of 

Pascal’s triangle, as shown on the right. 


You can write the probabilities for the number of successes in three throws 
of the die like this: 


Por=0)=1x(4) 


Pir=1)=3x2x(1) 
P(x=2) =3x/(2) x 


Por=3)=1x(2) 


If the tree diagram is extended to six stages (that is, six throws of the die), 
there would be 64 different paths, and it would be cumbersome to work 
through the whole process like this. 


However, the coefficients in Pascal's triangle and the patterns seen in the 
diagram with three stages can be used to write down expressions for the 
probabilities of getting 0, 1, 2, 3, 4, 5 or 6 successes: 


is a4 
14641 
25 © 165 2 
16 15 20 15 6 1 


x 0 a 2 3 4 


necea| (A) (2) a) BF | | se) 


Introducing the binomial distribution 


Example 1 


If the probability that Saanvi is late home from work on any day is 0.4, what is the probability 
she is late home twice in a five-day working week? 


The relevant row in Pascal's triangle is 1, 5, 10, 10, 5, 1, where the 
numbers represent how many ways you can get 0, 1, 2, 3, 4 and 
5 out of 5. There are 10 ways of getting 2, so 


P(late twice) = 10 x 0.4? x 0.6°. 


Example 2 


Adisa tries to eat at least five pieces of fruit each day. The probability that he does is 0.6, 
independent of any other day. What is the probability that Adisa eats at least five pieces of fruit on 
more than half the days in a given week. 


‘The relevant row in Pascal's triangle is the next one: 


fie es snes e : eee | 
P(more than half the days) 


= 35 x 0.6' x 0.47 + 21 x 0.6° x 0.4? + 7 x 0.6° x 0.4 + 0.67 


When the number of stages (or trials) is large (> 6), it is hard to use Pascal's 
triangle to find the binomial coefficients. 


The number of paths giving r occurrences out of m cases equals the number 
of ways of choosing r out of n, which is 


fia col es _ atl 
c-(") ri(n—r)!” 


and can also be found in the nth row of Pascal's triangle (the row 
starting 1, n, ...). 


Most scientific calculators have a button marked "C_ 


The binomial probability distribution is defined as 


plana r=0,1,2,....n q=l—p. 
r 
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We need to have values for n, the number of trials, and for p, the probability 
of ‘success’ on any one trial, in order for this to make sense, so there is a 
family of binomial distributions with two parameters. 


The parameters of the binomial distribution are n, the number of trials, 
and p, the probability of a ‘success’ on any one trial. 


For simplicity, the distribution is often written as X ~ B(n, p) 


\| 


Example 3 
If X ~ B(12, 0.2), find the probability that X = 3. 


P(X = »-(')}oaresy = 0.236 (3 s.f.) 


Example 4 
IfX ~ BuO: 0.9), find the probability that X < 8. 


Rather ites calculate all of the probabilities that X is 0, 1, 2, ..., 8, we can calculate P(X = 9 or 10) 
as the complementary probability and subtract this from 1. 


P(X<8)=1 -|( 9 osreneosy 02st s.f) 


Example 5 


If 25 fair dice are rolled, find the probability that three 6s are seen. 
SAREE REE EERE Eee EEE EEE EEE EEE E EEE EES EEE ESSE SESE EE EED Preeeee rere rrrrr irri trite r errr | 


If X = number of 6s seen, then X ~ B (25.4) 


norey=(5 (3) = 


‘—)) You can use a spreadsheet or handheld calculator to 
work out binomial probabilities by specifying n, p and 


a 
fs 
tele] 
— the value of X. 
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Exercise 7.1 


1. 


a) ‘The first few rows of Pascal's triangle are shown here. 
Write down the next row. 


b) Work out the values of 


F 7 aay 
i) 5 ii) C,. 


c) Make sure you know which elements in Pascal's triangle in part 
(a) correspond to the calculations in part (b). 


a) Calculate 


i) 10 ii) 9 

i 4 } ii és 

iii) (3) iv) (‘} 
6 2 


b) Calculate 


i) "C, ii) "C, 

iii) °C, iv) "C,,. 
If X ~ B(6, 0.5), find the probability that 
a) X=2 b) X=5. 
If X ~ B(6, 0.3), find the probability that 
a) K=2 b) X=5. 
If X ~ B(12, 0.4), find the probability that 
a) X=3 b) X=8. 
If X ~ B(12, 0.7), find the probability that 
a) X=3 b) X=8. 


IfX ~ (10, 3} find the probability that 
a) X=0 b) X=5. 
Ifx ~ (15,2) find the probability that 
a) M=7 b) X=8. 


If a fair coin is tossed six times, calculate the probability of seeing 
a) 2heads b) 5Sheads. 


: i a ee 
146441 
150051 
16 15 20 15 6 1 
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10. Ifa fair coin is tossed ten times, calculate the probability of seeing 
a) i) Oheads ii) 1 head 
iii) 2 heads iv) 3 heads. 
b) Using your answers to part (a), calculate the probability of seeing 
i) no more than | head 
ii) at least 3 heads 


iii) more than 3 heads. 


11. If X ~ B(12, 0.15), find 
a) P(X>2) b) P(X< 10). 


12. If X~ (20,2), find 
4 
a) P(X <2) b) P(X> 19). 


13. Suki was absent when the class were told they would have a test during 
the next lesson, so she has done no revision and has to guess the answers 
to five multiple-choice questions. If there are four choices for each question, 
find the probability she gets 
a) none right 


b) three right. 


7.2 Mean and variance of the binomial distribution 


How many ‘fives’ would you expect to get, on average, if you rolled a fair 
die 120 times? 


With a fair die, the probability of getting a five on any go is 3, so ‘on average’ 
you would expect to see 7 of the rolls come up with a five; that is, on average 
you would expect 20 fives in 120 throws. 


If you did a large number of sets of 7 trials of a binomial probability 
distribution, X ~ B(n, p), then the average number of successes out of 71 trials 
would allow you to estimate the value of p. If p is the probability that any 
individual trial gives a success then the proportion of successes in the long 
term will be equal to p. 


Did you know? 
One important property of random processes is that long-term behaviour is quite 
predictable, although short-term behaviour is consistently unpredictable. The ‘law of 
large numbers’ sets out these properties, but is beyond the scope of this book. 


Mean and variance of the binomial distribution 


If X ~ B(n, p), then the mean E(X) = np, and the variance Var(X) = 0 = 


npq, where q = 1 -p. 


You must be able to use these results but are not required to prove 
them formally. 


Example 6 
If X ~ B(10, 0.2), find the mean and variance of X. 


n= 10 and p = 0.2, so q = 0.8, 
and np = 2, npq = 1.6. 


Therefore the mean is 2 and the variance is 1.6. 


Example 7 

If X ~ B(80, 0.4), find the mean and standard deviation of X. 

n= 80 and p = 0.4, sog = 

and np = 32, npq = 19.2. 

Therefore the mean is 32 and the variance is 19.2, so the standard deviation is V19.2. 


Example 8 
X is a binomial distribution with mean 8 and variance 6.4. Find P(X < 1). 


np = 8, npq = 6.4 > q= 0.8 => p= 0.2, n = 40. 
Using the X ~ B(40, 0.2) distribution we get P(X < 1) = (0.8)"° + 40(0.2)(0.8)* = 0.00146 (3 s.f.). 


Exercise 7.2 
1. If X ~ B(35, 0.5), find 
a) E(X) b) Var(X). 
2. If X ~ B(75, 0.4), find the mean and standard deviation of X. 
3. Xisa binomial random variable based on 25 trials. E(X) = 20. 
a) Find the standard deviation of X. 
b) Find P(X > yu), where u = E(X). 


The binomial distribution 


4. Xisa binomial random variable with E(X) = 30 and Var(X) = 21. 
Find the number of trials. 


5. Jakob is a dentist who finds that, on average, one in five of his patients 
does not turn up for their appointment. 

a) Using a binomial distribution, find the mean and variance of the 
number of patients who turn up for their appointments, in a clinic 
with 20 appointments. 

b) Find the probability that more than three patients do not turn up for 
their appointment in that clinic. 


6. If X ~ B(20, p) find Var(X) in terms of p. Find the value of p which gives 
the largest variance. 


7. IfX ~ B(8, 0.12) find P(X <u — 0) where = E(X), 0 = J Var(X). 


7.3, Modelling with the binomial distribution 


‘The graphs that follow show the probabilities for a number of binomial distributions. 
They show the way probabilities in the binomial family behave. It is helpful in 
modelling any situation if you can develop some feeling for what the probabilities 
for any particular binomial distribution would look like. 


n=6,p=05 n=6,p=07 


Probability 
Probability 


0123 4 8 6 o 12 3 4 5 6 


Number of successes Number of successes 


When p = 0.5, the distribution is symmetrical and peaks at three successes 
out of six trials. 


When p = 0.7, successes would be expected to come more often and the 
distribution peaks at four successes out of six trials. 
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There are about three or four of the probabilities in these cases that would be 
described as relatively high. The largest individual probabilities can be estimated 


as being around 4, 


n=10,p=05 n=10,p=0.7 
0.35 0.35 
0.30 0.30 
0.25 4 0.25 
2 
3 0.20 2 0.20 
sg 5 
& 0.15 3 0.15 
0.10 + 2 0.10 
0.05 4 0.05 
vy) vo} 
o123 4 5 6 7 8 9 10 o 123 4 5 6 7 8 9 10 
Number of successes Number of successes 


If n is increased to 10, there are a larger number of the possible outcomes 
which have relatively high probabilities, but since the total probability is 1 

it follows that the largest probabilities cannot be as high as in the previous 
cases. With p slightly larger than 0.5, the distribution will be slightly skewed, 
with a tail of smaller probabilities for low numbers of successes. 


was n=20,p=0.5 


0.30 


0.25 


0.20 


0,15 


Probability 


0.10 
0,05 


o 12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 
Number of successes 


With a larger value of n (here 20) and p = 0.5, the graph is symmetric, with again 
about half of the possible number of successes contributing relatively high 


probabilities. The highest single probability is less than és and the total probability 


of seeing 9, 10 or 11 successes is less than 3: 
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n= 20,p=0.25 


0.35 
0.30 
0.25 
2 020 
8 
8 015 
< 
0.10 4 
0.05 4 
t) tt 
o 12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 
Number of successes 
= 20,p=0.9 
0.35 ul p 
Oooo 
o 12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 
Number of successes 


When the value of p is moved down or up, the peak of the distribution is 
moved left or right, respectively. When p moves further from ‘ the distribution 
peak gets higher and the distribution becomes less spread out (there is 


lower variance). 


The binomial distribution can be used for any situation where the aim is to 
count the number of times a particular outcome is observed out of a fixed 
number of cases - provided certain conditions are satisfied. 


The conditions for the binomial are: 
There has to be a fixed number of trials. 
Each trial must have the same two possible outcomes (normally 


referred to as ‘success’ and ‘failure’). 
The trials have to be independent of one another. 
‘The probability of success in each trial has to remain constant. 


Independence is a key condition. However, there are a number of situations 
where independence might be presumed to be present, but in fact is not. 


eZ am Modelling with the binomial distribution 


The following are examples in which conditions 3 and 4 
do not hold, and the binomial is not appropriate. 


such as ‘A-level subjects’ and ‘choice of meal’ would not 

be independent? 

0 The lessons being taught before lunch will affect which 
students arrive together for lunch. 


e Consider a task to be performed repeatedly, for example an t 
( 


e@ Consider choosing five students from a group arriving at the 

recreational area at lunchtime. Why is it likely that things £ » £) TY ar al 

archer shooting at a target. 

0. The binomial conditions require that no learning takes place 
during the sequence of trials. 

e Consider a situation in which the underlying trial changes 
in some way. For example, consider the assumption that 
golfers have, individually, a fixed probability of getting a 
particular score on every hole. 


This assumption ignores various features of golf. For example: 

0 Each hole will have a varying degree of difficulty. 

0. Ifone player in a match plays the hole particularly well, 
or badly, this may affect the strategy adopted by his 
opponents. 


o Independence between holes for the same player (or for the 
same hole but different players) is at best an approximation 
of reality. 


Independence does not apply if sampling without replacement is being 
undertaken. In a population of size N, the second observation comes 
from a reduced population of size N—1, the third from a population of 
size N—2, and so on. The larger the population is to start with, the 
smaller the error introduced by assuming that the probability remains 
fixed at the starting proportions. 


When N is large you can sometimes use the binomial distribution as an 
approximation, even if it isn’t exactly right. 


That is, if the conditions are not met exactly, it is still possible for the binomial 
to provide a useful model. The binomial is therefore used in a lot of real-life 
situations, where only a close approximation is required. 
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Exercise 7.3 


1. For the following random variables state whether they can be modelled 
by a binomial. If they can, give the values of the parameters n and p. 
If they cannot, explain why. 

a) Adie is thrown repeatedly until a 1 is seen. X = number of throws. 

b) A die is thrown 10 times. X = number of Is seen. 

c) A bag has 25 red and 25 blue balls in it. 5 balls are taken out. 
X = the number of red balls taken out. 

d) X=number of boys in a family of 5 children. 

e) A pair of dice are rolled 25 times. X = number of times a double 
(that is, two 1s, two 2s, and so on) is thrown. 

f) A pair of dice are rolled 25 times. X = average of the sum of the 
numbers rolled. 


2. a) If X ~ B(30, 0.1), find the probability that X is exactly 
i) 0 ii) 1 
iii) 2 iv) 3. 
b) State the mode of X. 
c) Give the mean and variance of X. 


3. For the following situations, state what assumptions are needed if a 
binomial distribution is to be used to model them, and give the 

values of 1 and p that would be used. (You are not expected to do any 

calculations.) 

a) On average, a traffic warden gives a parking ticket to 8% of the cars 
he checks. One morning he checks 40 cars. How many tickets does 
he give out that morning? 

b) A box has 48 screws in it. On average, 2% of the screws made by 
that manufacturer are faulty. How many screws from the box are 
faulty? 

c) A bag has five red, three blue and two green balls in it. Balls are 
taken out and the colour noted before the ball is returned. This is 
done 50 times. How many times was a blue ball taken out? 

d) A large drum has coloured balls in it; 50% are red, 30% are blue and 
20% are green. 50 balls are removed and the number of blue balls is 
counted. 


For each of the situations described above, evaluate how well your stated 
assumptions are likely to be met. 
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A vet thinks that the number of male puppies in litters of a given size 

will follow a binomial distribution with p = 0.5. 

a) In litters of six puppies, what would be the mean and variance of 
the number of males, if it is a binomial distribution? 

The vet records the number of males in 82 litters of six puppies. The 

results are summarised in the table. 


Number of males | 0 1 2 3 4 5 6 
Frequency 8 10 | 15 | 16 |] 14} 12 Zz 


b) Calculate the mean and variance of the number of males in litters 
of six puppies. 


c) Is the binomial a good model for the number of males in a litter of 
puppies? 
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EXAM-STYLE QUESTION a) Calculate the value of n. 

1. A quality control agent tests sets of 10 ‘The expected number of people with green 
components from a production line eyes in a second random sample is 3. 
which is known to produce 98% good 


b) Find the standard deviation of the 


number of people with green eyes in this 
a) Find the probability that a set chosen at second sample. 


random is free from defects. 


components. 


EXAM-STYLE QUESTION 

3. A recent survey suggested that the 
proportion of 15-year-old girls who never 
consider their health when deciding what to 
eat is 0.1. 


b) Ifthe quality control agent tests five 
sets before lunch, find the probability 
that four of these sets were free from 
defects. 


c) Inone week, the agent tests 70 sets. 
On average, how many sets does 
she find which are not free from 
defects? 


Assuming that this figure is accurate, what 
is the probability that in a random sample 
of thirty 15-year-old girls the number who 
never consider their health when deciding 


It is estimated that 4% of people have green sWliatitorestis 


eyes. In a random sample of size n, the 


expected number of people with green 
eyes is 5. $ b) exactly four? 


a) four or fewer 
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EXAM-STYLE QUESTION 5. The table below shows, for a particular 
population, the proportions of people in 


4. Each evening Louise sets her alarm for 
each of the four main blood groups. 


7.30 am. 

She believes that the probability that 

she wakes before her alarm rings each 
morning is 0.4, and is independent from 
day to day. 


Blood group | oO A B AB 
Proportion | 0.40 0.28 | 0.20 | 0.12 


a) A random sample of 40 people is selected 
from the population. Determine the 
probability that the sample contains 


a) Assuming that Louise's belief is correct, 
determine the probability that, during a 
week (five mornings), she wakes before 


i) at most ten people with blood 
her alarm rings 


group B 

i) on two or fewer mornings ii) exactly five people with blood 
group AB 

iii) more than ten but fewer than 20 
people with blood group O. 


ii) on more than one but fewer than 
four mornings. 


b) Assuming that Louise's belief is 
correct, calculate the probability that, 
during a four-week period, she wakes 
before her alarm rings on exactly seven 
mornings. 


A random sample of 750 people is selected 
from the population. 


b) Find the values for the mean and 
variance of the number of people in the 
c) Assuming that Louise's belief is correct, sample with blood group A. 
calculate values for the mean and 
standard deviation of the number of 
mornings in a week when Louise wakes 


before her alarm rings. 


6. Ronnie and Parveen regularly play each 
other at tennis. 
The probability that Ronnie wins any game 
is 0.3, and the outcome of each game is 
independent of the outcome of every other 
game. 


a) Find the probability that, in a match of 
15 games, Ronnie wins 


d) During a 50-week period, Louise 
records, each week, the number of 
mornings on which she wakes before her 
alarm rings. The results are as follows. 


Number of 
mornings 
Frequency 10 | 1 So | 9 16 |S 


i) exactly four games 


o;1;2)3)4]5 


ii) fewer than half of the games 


iii) exactly half of the games 


iv) more than one but fewer than 


i) Calculate the mean and standard i 
six games. 


deviation of these data. P . 7 , 
Ronnie attends tennis coaching sessions for 


three months. 

He then claims that the probability of 

him winning any game is 0.6, and that the 
outcome of each game is independent of the 
outcome of every other game. 


ii) State, giving reasons, whether your 
answers to part (d)(i) support Louise's 
belief that the probability that she 
wakes before her alarm rings each 
morning is 0.4, and is independent 
from morning to morning. 
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b) i) Assuming this claim to be true, 
calculate the mean and standard 
deviation for the number of games 
won by Ronnie in a match of 15 games. 


ii) To assess Ronnie’s claim, Parveen 
keeps a record of the number of 
games won by Ronnie ina series of 
10 matches, each of 15 games, with 
these results: 

9117109 1287 8 10 


Calculate the mean and standard 
deviation of these values. 

Hence comment on the validity of 
Ronnie's claim. 


7. Plastic clothes pegs are made in several 


colours. 


The number of blue pegs may be modelled 
by a binomial distribution with parameter 
Pp equal to 0.2. 
‘The contents of packets of 40 pegs of several 
colours may be considered to be random 
samples. 
a) Determine the probability that a packet 
contains 
i) at most 10 blue pegs 
ii) exactly 10 blue pegs 
iii) more than 5 but fewer than 15 blue 
pegs. 
b) Conn, a statistics student, claims to have 
counted the number of blue pegs in each 
of 100 packets of 40 pegs as part of a 
homework assignment. From his results 
these values are calculated: 
Mean number of blue pegs per 
packet = 8.5 
Variance of number of blue pegs per 
packet = 18.32 


Comment on the validity of Conn’s claim. 


EXAM-STYLE QUESTION 


8. 


Copies of an advertisement for a course in 
practical statistics are sent to mathematics 
teachers in a particular country. For each 
teacher who receives a copy, the probability 
of subsequently attending the course 

is 0.07. 


18 teachers receive a copy of the 
advertisement. 

What is the probability that the 
number who subsequently attend the 
course will be 

a) two or fewer 

b) exactly four? 


a) State two assumptions of the binomial 
distribution. 
b) A small garden decoration contains four 
electric light bulbs. 
The probability that a bulb is faulty is 0.09. 
i) What is the probability that exactly 
two bulbs are faulty? 
ii) What is the probability that at least 
one bulb is faulty? 
c) A set of outdoor lights consists of 20 light 
bulbs connected so that if at least 
one is faulty then none of the bulbs 
will light. 
The probability that a bulb in the set is 
faulty is p. 
Show that if p = 0.034 there is 
approximately a 50 : 50 chance that the 
set of lights does not light. 
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EXAM-STYLE QUESTIONS 
10. Ina set of coloured beads used in costume 
jewellery, 10% are purple. 
a) Find the probability that in a string of 30 
beads two or fewer beads are purple. 
b) Calculate the probability that in a 
string of 32 beads exactly two beads are 
purple. 
c) State one assumption that you have made 
in answering parts (a) and (b). 


11. Ina certain town, 72% of cars are fitted with 
satellite navigation equipment. A random 
sample of 10 cars from this town is chosen. 
Find the probability that at least 8 of these 
cars are fitted with this equipment. 


12. Ina certain skiing resort more than 15cm of 
snow falls on 30% of the days in high season. 
Menna is planning a holiday spending 7 days 
at the resort during high season. 
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13. 


i) Assuming the days on which more than 
15cm of snow falls occur at random, 
what is the probability that Menna will 
have fewer than 3 days on which more 
than 15cm of snow falls? 


ii) Comment on whether it is reasonable to 
assume randomness in this situation. 


Light bulbs are sold in boxes of 20. There 

is a constant probability of any light bulb 
being faulty, independently of any other 
bulb. The mean number of faulty light bulbs 
in a box has been found to be 1.2. Find the 
probability that a box contains less than 

2 faulty bulbs. 


Chapter summary 


The binomial probability distribution is defined as 


Pox=n=(")pa~ PSU aanih Gly 


An=ny 


where (" ="C=—2 
r 


The parameters of the binomial distribution are n, the number of trials, and p, the probability 
of a ‘success’ on any one trial. 


The binomial distribution is often written X ~ B(n, p). 

If X ~ B(n, p), then the mean E(X) = np, and the variance Var(X) = 0° = npq, where q = 1 - p. 
The conditions for the binomial are: 

1. There has to be a fixed number of trials. 

2. Each trial must have the same two possible outcomes (normally referred to as ‘success’ 
and ‘failure’). 

3. The outcomes of the trials have to be independent of one another. 

4, The probability of each of the outcomes has to remain constant. 


When sampling without replacement from a population of size N, where N is large, the 
binomial distribution is a useful model even though it is not exactly right. 
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ric distribution 


Archimedes was a Greek who lived in the third century BC and is 
widely regarded as the greatest mathematician of the ancient world. 
He is probably most famous for his work in various branches of 
mechanics, but he also had a profound influence in other areas 
such as pure mathematics where his work with geometric series is 
fundamental to the geometric distribution. 


Objectives 


e Use the formula for probabilities for the geometric distribution. 


e Recognise practical situations where the geometric distribution is a suitable model. 
e Use the notation X ~ Geo(p). 
e Use the formula for the mean of the geometric distribution. 


Before you start 


You should know how to: Skills check: 
Calculate conditional probabilities. 1. A fair die is thrown; A is the event ‘the 
1. A single-digit positive integer is selected at number is odd’ and B is the event ‘the 
random; A is the event ‘the number is odd’ number is prime. 
and B is the event ‘the number is prime. Find the probabilities 
Find the probabilities a) P(BIA) 
a) P(BIA) b) P(B| 4’). b) P(BIA’). 


a 3 
a) P(A)= g P(BA) = 3 since 3,5 and 7 


are odd primes: so P(B| A) = - = > 


4 
b) P(A’)= Py P(BOA’) = 1 since 2 is the 
| 


i 
only even prime: so P(B| A’) = z= 
> 


2. I toss two fair coins. What is the probability of | 2. I throw two fair dice. What is the probability 
seeing two heads, given that at least one head is of seeing two sixes, given that at least one six 
showing? is showing. 


P(2 heads) = —; P(at least 1 head) = =, so 


me 


P(2 heads | at least 1 head) = + = 


8.1 Introducing the geometric distribution 


Some children’s games include having to throw a 6 on your turn to be able 
to start moving on the board. It can be hugely frustrating when you have to 
wait while others are already moving. But just what is the mathematics of 
this situation? 


The likelihood of only needing one throw is 7 but what happens after that? 


If I get started on the second throw I know I did not get a 6 the first throw 
and then I did get a 6 the next throw, so the probability is : x 2 = =. 
If I get started on the third go, I had to not get a 6 twice and then get a 6, 
so the probability is (2) 4 = 

IfI start on the tenth go I must have not got a 6 on all of the first nine 


throws and then got a 6, so the probability is (2) x - Ican see a really 

strong pattern that I know will always be the case: if I start on the rth 

go I must have not got a 6 on all of the first (r = 1) throws and then got 
5 1 


a 6, and the probability will be (2) x ze 


Example 1 
I play a game in which I need to throw a 6 to start. What is the probability that 


a) Istart on my fourth turn 
b) Ihave not started after 6 turns? 


> Continued on the next page 
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a) Ihave to get something other than a 6 on the first three turns and then throw a 6, so the 
3 
=) x 4 = 1 = 0.0965 (3 sf) 
6 1296 
b) Lhave to get something other than a 6 on all of the first six throws, so the probability is 


sy 
(:) = 0.335 (3 s.f.) 


probability is e 


The reasoning that to start on a particular turn requires having 
‘failures’ (not throwing a 6) on all turns up to that one and then 
having a ‘success’ (a 6) holds for all positive integers r, and is an 
example of the geometric distribution - so called because the 
ratio of probabilities is constant, so they form a geometric series. 


The geometric distribution is defined as 


P(X=r)=q'p  7=1,2,3... q=l-p 


We only need to have the value of p, the probability of a ‘success’ 
on any one trial, for this to make sense, so there is a family of geometric 
distributions with one parameter. 


The parameter of the geometric distribution is p, the probability of a ‘success’ 
on any one trial. 


For simplicity the distribution is often written as X ~ Geo(p). 


The family of geometric distributions has some interesting features: 
e@ P(X =r) = qx P(X =r -1)- this recurrence relation is what 


characterises the geometric distribution. 


e P(X=r)>0_ forallr—so every geometric distribution has an 
infinite possibility space (the set of positive integers). 

e P(X =r)< P(X =r- 1) for all r (except in the trivial case where 
p= 0 or 1) -so the mode of every geometric distribution is 1. 

e P(x =r|X> k) = P(X =r- k), k <r - sometimes known as the 


memorylessness property of the geometric distribution — such that the 
waiting time for the event to happen does not depend on how much 
time has passed (or how many trials have already happened). 
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Example 2 


Example 4 
I toss a fair coin until I see the first head. What is the probability that I need 


a) exactly three tosses 
b) at least five tosses? 


if Xis the number aii fees until the first head 
then X ~ Geo(0.5) and I need to find P(X = 3). 


a) P(X =3)=0.5* x 0.5 = 0.125 (= +) 


b) P(X 25) =P(X > 4) = 0.5" = 0.0625 E = . 


Example 5 
If X ~ Geo(0.4) find 
b) P(X > 4) 


If X=6 then the 
= 0.6* = 0.1296 condition X > 4 
adds nothing. 
=0.6° x 0.4 = 0.031 104 
P(X =6andX >4 : 
Al and X > ye = 2504 06x04 = 024 


Graphs of two examples of the geometric distribution are shown on the next 

page using the same scales on the two axes so you can make realistic comparisons. 
You can see the geometric decrease in the probability (the heights of successive bars) 
in both cases, with the mode being 1. You can also see that when p is smaller 

the distribution extends further before probabilities become negligible 

(though never become zero for any parameter) p # 0,1. 
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Geometric distribution p = 0.8 


Geometric distribution p = 0.3 


0.9 
0.8 
0.7 + 
2 06 
8 05 
2 044 
03 
0.2 
O.4 
0 
1234567 8 9 1011 1234567 89 1011 

Exercise 8.1 

1. IfX ~ Geo(0.75) calculate 
a) P(X =2) b) P(X >3). 

2. If X ~ Geo(0.2) calculate 
a) P(X =2) b) P(X <2 

3. IfX ~ Geo(0.1) find 
a) P(X =8) b) P(X >6). 

4. IfX ~ Geo(0.6) calculate 
a) P(X =5) b) P(X 5 

5. If X ~ Geo(0.4) find 
a) P(X =4) b) P(X =6|X > 2). 

6. A fair spinner has four equal sections coloured red, green, blue and black. 
It is to be spun until it lands on green. What is the probability that it takes 
more than four spins to see green for the first time? 

7. A production line has a 3% defect rate. Items are inspected at random. 


What is the probability that no defects are found after 50 items have 


been inspected? 


Josie is the captain of a five-a-side football team playing in a league of eight 
teams. She has lost the toss in each of the first three games. What is the 
probability that she will not win the toss in the league that season? 

(Josie’s team play seven matches in the league during the season.) 


Marta is a good soccer player who has a record of scoring with 15% 
of the shots she attempts. In a particular game Marta has eight shots. 
Calculate the probability that Marta does not score in the game. 


Introducing the geometric distribution 


8.2 The mean of the geometric distribution 


Consider a series of 200 Bernoulli trials in which the probability of success 
is 0.4. We know from Chapter 7 that, on average, 80 (= 200 x 0.4) successes 
will be observed in that sequence. Ignoring the fact that any failures at the 
end of the 200 trials recorded will not be included in a block, this means that 
the average length of blocks up to a success is 2.5. 


We can generalise the reasoning to n trials with a probability of success on each 


trial of p, where the expected average length of blocks is given by x ~ = x. 


np p 
The approximation is there because of any end failures which are not part 


of a block ending in a success, but as n gets very large this becomes negligible. 


The formal proof of the mean is not required in this syllabus, though you are 
required to use the result. 


If X ~ Geo(p) then E(X) = 


Example 6 

If X ~ Geo(0.8), find 
a) E(X) 

b) P{X <E(X)}. 


Example 7 
If X ~ Geo(p), and E(X) = 4, find 
a) the value of p 

p{x > E(x)} 


the least integer n such that P(X > n) —<1l70s 


P{X > E(X) = 4} =0.75* = 0.316 
P(X > n) = 0.75"; for this to be < 0.01 (1%) 
the least n is 17. 
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Exercise 8.2 
1. IfX ~ Geo(0.4), state the value of E(X). 
If X ~Geo(0.1), state the value of E(X). 


If X ~ Geo(0.2), determine P{x < E(X)}. 


- 2 


If X is a geometric random variable and E(X) = 3, determine 
a) P(X=1) 
b) P(X > 3). 
5. IfX isa geometric random variable and E( X) = 2, determine 
a) P(X =2) 
b) P(X > 4). 
6. If X is a geometric random variable and E( X) = 5, determine 
a) P(X =4) 
b) P(2< X<5). 
7. Carmen wants to take a taxi. Where she is, 5% of the vehicles are taxis. 


X is the number of vehicles that she sees up to and including when 
she sees a taxi. 


a) On average, how many vehicles will she see up to and including 
when she sees a taxi? 


b) Calculate the probability she will see a taxi in the first five vehicles. 


c) Calculate the probability that she is still waiting to see a taxi after 
she has seen 30 vehicles. 


d) Evaluate the mode of X, i.e. what number vehicle is the most likely 
to be the first taxi she sees? 


8. Ina game using a fair die, you have to throw a 6 to start. The number 
of throws needed to obtain the first 6 is N. 


a) State the distribution of N. 
b) Find P(N = 2). 

c) Find P(N <5). 

d) FindP{N 2 E(N)}. 


iki ~The mean of the geometric distribution 


9. Ina certain country the proportion of babies that are male is 48.78%. 
Assume that in a family the gender of babies is independent. X is the 
number of babies up to and including when the first girl is born. 


a) State the distribution of X. 
b) Determine E(X). 
c) Determine P(X = 1). 


d) Determine P{X > E(X)}. 
8.3, Modelling with the geometric distribution 


The geometric distribution is based on the waiting time until the first ‘success’ 
is registered in a sequence of Bernoulli trials. In Chapter 7 you met the 
binomial distribution which counted the number of successes in a fixed 
number of Bernoulli trials, so there are strong similarities in the two contexts 
and you should look back at Section 7.3 to remind yourself of the sorts of 
contexts in which the assumptions of independence and constant probability 
may not hold. 


‘There are one or two things to add to the discussion in Chapter 7 which are 

of particular relevance to contexts used for the geometric distribution. One 
example is retaking an examination until you pass it - for example taking the 
driving test in the UK can be done as often as you want and when you pass 

you have a licence to drive. Is it reasonable to assume that the probability of 
passing remains constant? When you get the results of the test, you are given a 
detailed breakdown of the things you didn't perform adequately on, so 

while it is a pass/fail (satisfying the need for a two-outcome event) it might 

be expected that an individual would take more lessons before retaking the 

test and that the instructor would focus particularly on faults from the 
previous test. However, you have to remember that models are not perfect 
representations of the real-life situation and the question arises as to whether 
the model is useful even if it is not perfect — is it reasonably close? Examination 
questions may use contexts in which the geometric distribution is not a perfect 
fit and you need to be prepared to work with it - they may ask you to explicitly 
comment on whether you think the assumptions made were reasonable, and in 
that case you are expected to critically evaluate whether or not independence 
and/or constant probability are reasonable assumptions. 


The geometric distribution 


140 


Example 8 


Anya makes repeated attempts to thread a needle. The number of attempts 
up to and including the first success is denoted by X. 


a) State two conditions necessary for X to have a geometric distribution. 


b) Assuming that X has the distribution Geo(0.4) find the probability that 
she needs more than three attempts before she threads the needle successfully. 


c) Suggest a reason why one of the conditions you gave in part (a) might 
not be satisfied in this context. 


A eeeeeencesceceseeesceseeeneeeneeeseeeseseneneenenenensenesssnenerenscsensesesesescsseeeseeessssecssecnersced 


a) Attempts are independent of one another, and the probability 
remains constant. 


b) P(X > 3)=0.6 = 0.216 
c) If she fails to thread the needle she might do something like moving closer 


to a bright light, or put her reading glasses on so that she can see 
better - which would change the probability. 


Example 9 


The diagrams illustrate all or part of the distributions of the discrete 
random variables X, Y, Z. 


Distribution of X 


Distribution of ¥ 


0.6 + 


Probability 


04 


Modelling with the geometric distribution 


Distribution of Z 


Probability 


One of these variables has the distribution Geo(0.7). State, with a reason, which variable it is. 


It is X because it is the only one with a mode of | and strictly decreasing 
probabilities. Y is a binomial with n = 5 and p = 0.7, and Z is uniform on 
the integers 0 to 5. 


You saw two examples of the graphs of geometric distributions in Section 8.1. 
Two more are given below, using the same scales: 


Geometric distribution p = 0.5 Geometric distribution p = 0.2 


09 $$ $$ i 0.9. }$$ $i 


123 45 6 7 8 9 10 11 123 45 67 8 9 10 11 


You know what the means of these distributions are 2 for the first one 

and 5 for the second, but just visually you can also see that the variance of 

the first one is much less than the second, and that while the individual 
probabilities for the first one have become negligible long before the horizontal 
scale stops at 11, the probabilities for the second one are clearly not negligible 
and the graph of this distribution would need to be extended to include more 
of the (infinite) possibility space. 


You should also note that when p is small, every possible outcome has a small 
probability — and there are a lot of them which are very similar; as p gets 
larger, the number of possible outcomes with non-negligible probabilities is small. 
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Exercise 8.3 


1. For the following random variables state whether they can be modelled 

by a geometric distribution. If they can, give the values of the parameter p. 

If they cannot, explain why. 

a) A die is thrown repeatedly until a 1 is seen. X = number of throws. 

b) A die is thrown 10 times. X = number of Is seen. 

c) A bag has 50 red and 25 blue balls in it. Balls are taken out and not 
replaced until the first blue is taken. X = the number of balls taken out. 

d) A bag has 2n red and n blue balls in it, where n is very large. Balls are 
taken out until the first blue is taken. X = the number of balls taken out. 

e) A pair of fair dice are rolled until a double one is seen. X = number of 
times the dice are rolled. 

f) A pair of fair dice are rolled 25 times. X = average of the sum of the 
numbers rolled. 


2. For each of the following geometric distributions state the mode and 
the mean, and find the median. 


a) X ~ Geo(0.6) 
b) Y ~ Geo(0.4) 
c) X~Geo(0.2). 


3. For the following situations, state what assumptions are needed if a 
geometric distribution is to be used to model them and give the value 
of p that would be used. Explain any situations where you feel the 
assumptions are questionable. (You are not expected to do any calculations.) 
a) Onaverage, a traffic warden gives a parking ticket to 8% of the 
cars he checks. How many cars does he check before he gives the 
first ticket of the day? 


b) A box has 48 screws in it. On average, 2% of the screws made 
by that manufacturer are faulty. How many screws can be taken 
from the box before one is faulty? 

c) A bag has five red, three blue and two green balls in it. Balls are 
taken out and the colour noted before the ball is returned. This is 
done repeatedly. How many balls are taken out up to and including 
seeing the first green ball? 

d) A large drum has coloured balls in it; 50% are red, 30% are blue 
and 20% are green. Balls are taken out and the colour noted 
(the ball is not returned). This is done repeatedly. How many 
balls are taken out up to and including seeing the first green ball? 


1@eae Modelling with the geometric distribution 


1. 


A good angler thinks the number of times he has to cast his fly before 


he gets a catch can be modelled 


by a geometric distribution. He records 


the number of casts he has made up to and including a catch for 80 
catches. The results are summarised in the table. 


Number of casts | 1 | 2 


Frequen | 42 | 20 
quency 


a) Calculate the mean number of casts he has made per catch. 


b) Calculate the expected frequencies of observing 1 to 8 casts up 
to and including a catch (for 80 catches) using the parameter for 
a geometric distribution that has the same mean as his results. 


c) Doyou think the geometric distribution is a good model for this 
situation? Explain, giving a reason for your answer. 


Summary exercise 8 


If X ~ Geo(0.2) find 
a) P(X =4) 
b) P(X > 6). 


If X ~ Geo(0.9), state the value of E(X). 


If X ~ Geo( +) find 
a) P(X =3) 

b) E(X) 

c) P{X > E(X)}. 


4. IfX ~ Geo(0.4) find 
a) P(X =3) 
b) P(X > 3) 
c) P(X =6|X > 3). 


d) Describe what property of the geometric 
distribution your answers to parts (a) 
and (c) are an example of. 


5. IfX ~ Geo(0.1), find P{X < E(X)}. 
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6. If X is a geometric random variable :  b) Using the distribution you stated in 
and E(X) = 4, find part (a), calculate the probabilities 


a) P(X=1) ; i) P(X =4) ii) P(X<3). [4 
b) P(x > 3). ic) State E(X). 

7. A fair spinner has eight equal sections 5 d) Calculate the probability that the first 
numbered 1| to 8. It is to be spun until it lands : date in March on which Adeline needs 
ona prime number. What is the probability fewer than the mean number of attempts 
that it takes more than four spins to see a : to light the fire is March 6th. 3 


prime sortie firstiaine: iH1.a) IfX,,X, are both geometric random 

8. Only 2% of vehicles on a particular stretch of : variables with parameter p, show that 
road are prepared to stop to give a lift to hitch- P(X, +X,= 3) = 2p°q. 3 
hikers. Amir wants to hitch-hike on that road. : 7 
Find the probability that he has not got a lift yet ! 
after 20 vehicles have gone past him. H 


b) IfY,, Y, are both geometric random 
variables with parameter 0.3, find 


H + F 2 
Do you think the geometric distribution isa} PY, %? 3) 


good model for this situation? Explain, giving 


$12. Y ~ Geo(p). 
a reason for your answer. : 


a) Show that 


. H P(Y iseven)=qp+qpt+qpt. 
EXAM STYLE QUESTIONS : (where q = 1-p). 3 
9. Thirty per cent of the members of a very $b) Use the formula for the sum to 

large group of fitness clubs are over 40 years} infinity of a geometric progression to 
old. The company owning the clubs want to} show that P(Y is even) = Gh 3 
H s ica 


interview a random sample of members who 

are over 40. The secretary uses the database of H 13. Christina and Novak play a game where they 
all members, and selects members at random : take turns to throw a fair dice. Christina goes 
until she finds ones who are over 40. first and the game ends when either player 
throws the first 6. Calculate: 


a) P(Novak wins on his first throw) 


If X denotes the number of members selected 
up to and including the first member who is 


over 40, calculate: :  b) P(Christina wins on her third throw) 
a) P(X =4) GI ¢) P(Novak gets a third throw) 

b) P(X>4) [l: d) P(Christina throws exactly three times) 
o) P(X <6). (3): e) P(Novak wins the game). 


10. Adeline has to light a gas fire for her 
grandmother every day. It is hard to light, and 
repeated attempts have a probability 


of be of success, independent of one another. 
3) 


a) IfXis the number of attempts needed 
until the fire lights on a particular day, 
state the distribution of X. ipa] 


Summary exercise 8 


Chapter summary 


The geometric distribution is defined as 

RX=r)=qp ra123.., gal—p 

The parameter of the geometric distribution is p, the probability of a ‘success’ on any one trial. 
The geometric distribution is often written as X ~ Geo( p). 

P(X - r) = qx P(X =r- 1) — this recurrence relation is what characterises the geometric 
distribution. 

P(X = r) > 0 for all r - so every geometric distribution has an infinite possibility space (the 
set of positive integers). 

P(X =r) <P(X=r- 1) for all r (except in the trivial case where p = 0 or 1) - so the mode 
of every geometric distribution is 1. 

P(X =r|X >k) = P(X =r—k),k <r — sometimes known as the memorylessness property 
of the geometric distribution — that the waiting time for the event to happen does not depend 
on how much time has passed (or how many trials have already happened). 


‘The conditions for the geometric distribution are: 

1. Each trial must have the same two possible outcomes. 

2. The outcomes of the trials have to be independent of one another. 
3. The probability of each of the outcomes has to remain constant. 
If X ~ Geo(p) then E(X) = - 
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The normal distribution 


Although the power of modern computers 

is now so great that a lot of modelling and 
simulations are now based on sampling from 
very large data sets rather than randomly 
generating from a probability distribution 
like the normal, it is still a very powerful 

and widely used distribution because of the 
range of situations in which it provides a 
very good approximation to reality. Very 
many measurements of living things are 
approximately normal, for example in this 
rapeseed field the harvest per square metre, 
and the heights of individual rapeseed plants 
would be approximately normal distributions. 


Objectives 
e Understand the use of a normal distribution to model a continuous random variable, and use 
normal distribution tables. 
e Solve problems concerning a variable X, where X ~ N(y, 0°), including: 
© finding the value of P(X>x,), or a related probability, given the values of x,, 4, 0 
© finding a relationship between x,, 4 and o given the value of 
P(X>x,) ora related probability. 


Before you start 


You should know how to: Skills check: 
1. Solve linear simultaneous equations, 1. Solve the simultaneous equations 
e.g. solve the simultaneous equations 36="—-0.50 
a+1.25b=18.25 (1) 46=1+0.50 
a—0.90b=7.50 (2) 
Subtract (2) from (1) 
2.15b=10.75 
b=5 
Substitute 5 for b in (1) 
a+5x1.25=18.25 
a=12 
2. Substitute values into simple equations, 2. Find p when a=16, b=6, c=2 
e.g. find x, when y=5, a=3, b=2 and given that post 
y - x-a ‘ c 
b x=3 


Substitute for y, a and b: 5=—= 
10=x—-3, x=13. . 


9.1 Continuous probability distributions and 
the normal distribution 


The normal distribution frequently occurs in the real world. 


For example, the heights, or weights, of people often follow an approximate 


normal distribution. 


Frequency density 


Weight (kg) 


Frequency density 


Length of bolt (mm) 


The normal distribution is a type of continuous probability distribution. Did you know? 
This means that: 


Physicists sometimes 
e It relates to a continuous variable (height, weight etc.). refer to this as 
e It describes the probability of this variable taking a particular the Gaussian 
distribution. 


range of values. 


Generally, the normal distribution looks like this: 


oe — 
Q a0 20 30 2 2 so 70 ee 28 100 a 


The normal distribution 


The idealised normal distribution has the following properties: 
e It is symmetric. 


It is infinite in both directions. 


It has a single peak at the centre. 


It is continuous. 

95% of values lie within approximately 2 
standard deviations of the mean. 

99% lie within approximately 3 standard 
deviations of the mean. 


In practice, real-life variables usually do not match all these conditions 
perfectly and the situations will only approximate the normal distribution. 
For example, height and weight cannot take negative values, however a 
zero value may occur a long way out towards the tail of the distribution, 

as in the diagram on the previous page, where the left hand end of the 
graph is already almost invisible at 20. If the value were not towards the tail, 
then the normal distribution would not be a reasonable approximation 

to use in that situation. 


‘The following will enable us to calculate probabilities: 


Area= 1 


_ > 
The total under the normal curve is 1. a b 


The probability that X lies between a and b is the area 
under the curve between x=a and x=b. 


Standardised scores 


The normal distribution allows us to make comparisons between individuals 
in a particular normal population. However, it becomes harder with 
different normal populations — for example, we use different criteria 

to judge a ‘tall man’ than we use for a ‘tall woman. 


1Z' Continuous probability distributions and the normal distribution 


In 2007 De-Fen Yao was 34 years old and 7ft 8" tall, The tallest recorded man was Robert Pershing 
making her the tallest woman ever. But how would Wadlow at 8 ft 11" tall — measured in 1940. 
she rank as a man in terms of height? 


Yelena Isinbayeva is the female 
world record holder in the pole 
vault (5.06 metres) — is her 
achievement more or less 
exceptional than 

Usain Bolt’s world record 

of 9.58 seconds for 100 metres? 


One way to compare different normal populations is to standardise _ This process allows comparison 
the scores — by looking at their distance from the mean, then of performances of pupils in 


haut a + aie different subjects, or of athletes 
dividing by the size of the standard deviation. uncer diferentconditiang: 


However, it is not the only process 
we may use to compare normal 
use the conversion'z =" populations, and events like the 

o decathlon and heptathlon use a 
where p is the mean and a is the standard deviation of points scoring system based ona 
the raw scores. The standardised score is often called the different approach. See ‘Sporting 
z-score. statistics’ on pages 112-113 for 
more on this. 


To find the standardised score, z, from a raw score, x, 


The normal distribution 


Example 1 

In her exams, Alexandra scores 75 in History and 87 in Maths. For the year group as 

a whole, History has a mean score of 63 in the examination with a standard deviation 

of 8, while Maths has a mean of 69 with a standard deviation of 15. Compare Alexandra's 
performance in these two subjects. 


seeeeeeeeeeeeees seeeteceencncececccesscnsscesescsssseesssssasssscessssssssscssssesssssesssssseced 


For History, g-= fo: =15 
For Maths, 7-889 Sila) 


Alexandra's standardised score z is higher in History than in Maths, so there is reason to 
say that her performance is better in History than in Maths. 


If we know corresponding points in two distributions, we should be able to work out 
an unknown mean or standard deviation. 


Example 2 

The mean height of a certain plant (A) is 67 cm, and the heights have a standard deviation 
of 5cm. Another plant (B) has a mean height of 63 cm. 

The same proportion of both types of plant is taller than 77 cm. 

What is the standard deviation of the heights for plant B? 


dd eee eee eee eeseneeeeeeeneeeeeeeeeseeeneeeeeen seen ee enaeeeeeeeeeeeaeseeseesesesesessssseseessessseesseasereoed 


0.1 f(x) 
Plant A 
0.08 
Plant B 
0.04 77cm is 10cm above the mean for A, or 
2 standard deviations. 77 cm is 14cm 
0.02 “above the mean for B and this must also 
7 be 2 standard deviations, so the standard 
v y r r jmruee deviation of the heights for plant B=7. 


Exercise 9.1 


1. p=56 o=7 
a) Find the standardised score for a raw score of 
i) 70 ii) 52.5 
iii) 66.5 iv) 56. 
b) Find the raw score for a standardised score of 
i) 13 ii) 24 
iii) -0.4 iv) 2.0. 


iti Continuous probability distributions and the normal distribution 


2. w=87 o=5 
a) Find the standardised score for a raw score of 
i) 80 ii) 59 iii) 91.3 iv) 86.7. 
b) Find the raw score for a standardised score of 
i) 23. ii) 21 iii) 06 iv) 10. 


3. w=3 o=12 
a) Find the standardised score for a raw score of 
) 1 i) 3 iii) 27. iv) 5.8. 


b) Find the raw score for a standardised score of 
i) 0.7 ii) -1.3 iii) -0.2 iv) 1.8. 


4. a) Ify=64, and a raw score of 76 has a z-score of 2, find o. 


b) Ifo=10, and a raw score of 43 has a z-score of —1.6, find yu. 


5. X has y=48 and o=10. Y has mean 53. 
If the same proportion of X and Y are above 68, find the standard deviation of Y. 


6. X has w=549 and o=34. Y has a standard deviation of 47. 
If the same proportion of X and Y are above 600, find the mean of Y. 


9.2 Standard normal distribution 


e The normal distribution is written as X ~ N (, 0°). 


This means ‘X is distributed as a normal random variable with mean y and variance 0”. 


Since all normal distributions are the same basic shape, we only need to have probabilities 
for one particular case to allow us to calculate probabilities for all cases. 


0.5 7%) 


The standard normal distribution has mean 0 and variance 1. 


The normal distribution 


The variable Z is often used for the standard normal distribution. 
e For the standard normal distribution, Z ~ N(0, 1’) or Z ~ N(0, 1). 


By converting values to the standard normal distribution, 
you can use probability tables to calculate probabilities for any 
normal distribution. 


2 
z| 0 1 2 3 4 5 6 7 8 gf | oe dey oe sa 
0.0 |0.5000 |0.5040 0.5080 0.5120|0.5160 0.5199 0,5239|0.5279 0.5319 0.5359 |4 8 12|16 20 24 |28 32 36 
0.1 |0.5398 |0.5438 0.5478 0.5517|0.5557 0.5596 0,5636|0.5675 0.5714 0.5753 |4 8 12|16 20 24 |28 32 36 
0.2 |0.5793 |0,5832 0.5871 0.5910|0.5948 0.5987 0,6026|0.6064 0.6103 0.6141 |4 8 12 |15 19 23 |27 31 35 
0.3 |0.6179 |0,6217 0.6255 0.6293|0.6331 0.6368 0,6406|0.6443 0.6480 0.6517 |4 7 11 |15 19 22|26 30 34 
0.4 |0.6554 |0,6591 0.6628 0.6664| 0.6700 0.6736 0,6772|0.6808 0.6844 0.6879 |4 7 11 |14 18 22|25 29 32 
0.5 |0.6915 |0,6950 0.6985 0.7019|0.7054 0.7088 0.7123|0.7157 0.7190 0,7224|3 7 10 |14 17 20|24 27 31 
0.6 |0.7257 |0.7291 0.7324 0.7357|0.7389 0.7422 0.7454|0.7486 0.7517 0.7549|3 7 10 |13 16 19 |23 26 29 
0.7 |0.7580 |0.7611 0.7642 0.7673|0.7704 0.7734 0.7764|0.7794 0.7823 0,7852|3 6 9|12 15 18 |21 24 27 
0.8 |0.7881 |0.7910 0.7939 0.7967|0.7995 0.8023 0.8051|0.8078 0.8106 0.8133|3 5 8 {11 14 16|19 22 25 
0.9 |0.8159 |0,8186 0.8212  0.8238|0.8264 0.8289 0.8315|0.8340 0.8365 0.8389 |3 5 8 |10 13 15 |18 20 23 
1.0 |0.8413 |0.8438 0.8461 0.8485]0.8508 0.8531 0.8554|0.8577 0.8599 0.8621|2 5 7] 9 12 14/16 19 21 
1.1 ]0.8643 |0.8665 0.8686 0,8708| 0.8729 0.8749 0.8770|0.8790 0.8810 0.8830|2 4 6 | 8 10 12|14 16 18 
1.2 |0.8849 |0.8869 0.8888 0.8907|0.8925 0.8944 0,8962|0.8980 0.8997 0.9015|2 4 6| 7 9 11} 13 15 17 
1,3 [0.9032 |0,9049 0.9066 0.9082|0.9099 0.9115 0.9131|0.9147 0.9162 0.9177|2 3. 5 | 6 8 10/11 13 14 
The tables give you P(Z<z)=P(Z<z), where z 
is any value from 0 to 3 (beyond that the probabilities 
have become so small they are negligible). 
To find P(Z<1): 
Locate z= 1.0 in the table. 


Go along to column 0, and read off the value below: 
P(ZS1)=0.8413. 


P(Z< 1) = 0.8413 


Because the normal distribution is 
symmetrical, the tables only give you half 


the set of probabilities — the other half 
are identical. 


So to find P(Z<-1): 
P(Z <-1) = 0.1587 
P(ZS-1)=P(Z>1)=1-P(ZS1) 


=1-0.8413 
= 0.1587 


Standard normal distribution 


Because you always use the standard 
normal distribution to find probabilities 
it is worth having a special notation 


for this. f(2) 
We write (z)=P(Z<z). 


So (—k)=1— O(k) by symmetry. 
You often need to use more than one value in the tables: 
P(0.5$z< 1.8)=0(1.8) — (0.5) =0.9641 — 0.6915=0.2726 
Zz 
4 #3 2 =-1 0 1 2 3 4 


P(-1<z<1.3)=@(1.3)—(—1) =0.9032—(1-0.8413) led 


= 0.7445 
‘The tables allow us to give probabilities (correct 
to 4 d.p.) for z-scores up to 2 decimal places, and 
estimates of probabilities for a 3rd decimal place in z. 
zr 
4 3 2 2 0 1 2 3 4 


Example 3 


0.5000 | 0.5040 0.5080 0.5120/0.5160 0.5199 0.5239) 0.5279 0.5319 0.5359 
0.5398 | 0.5438 0.5478 0.5517 | 0.5557 0.5596 0.5636] 0.5675 0.5714 0.5753 
0.5793 | 0.5832 0.5871 0.5910/0.5948 0.5987 0.6026] 0.6064 0.6103 0.6141 
0.6179 | 0.6217 0.6255 0.6293|0.6331 0.6368 0.6406) 0.6443 0.6480 0.6517 
0.6554 | 0.6591 0.6628 0.6664/0.6700 0.6736 0.6772) 0.6808 0.6844 0.6879 
0.6915 | 0.6950 0.6985 0.7019/0.7054 0.7088 0.7123) 0.7157 0.7190 0.7224 20) 24 2 


To find the probability with a z score of 0.247 using the table, we find the row corresponding to 
z=0.2. We then find the column headed ‘4’ (corresponding to the second decimal place) and then 
the column headed ‘7’ in the extra section to the right of the main table. These extra columns on 
the right-hand side give the average extra probability for the 3rd decimal place values at the top of 
the columns. We add the two digits found from the last column to the third decimal place of the 
number found in the previous column, 

i.e. 0.5948 +0.0027=0.5975 
and so, 

P(Z<0.247)=0.5975 


The normal distribution 


Example 4 
If Z ~ N(0, 1’) find 
a) P(Z<1.62) b) P(Z>0.76) cc) P(Z<-1.32) d) P(-1.2<Z<1.7). 


aaececcccescnceccccscsessnccscccessee serecccccececcsccccccscsccscccccsseed 


a) P(Z<1.62)=@(1.62)=0.9474 b) P(Z>0.76)=1—(0.76) 
= 1-0.7764=0.2236 
0.5 4 f(z) 0.5 > f(z) 


c) P(Z<-1.32)=1- (1.32) d) P(-1.2<Z<1.738)=0(1.738)—®(-1.2) 
= 1- 0.9066 = 0(1.738)—(1—(1.2)) 
= 0.0934 = 0.9589—(1—0.8849) 
= 0.8438 


If you have a z-score which is not listed, such as z= E you should 
calculate it to 3 decimal places and use that value. 


B= 1.6666...= 1.667 (3 d.p.) 


(1.667) = 0.9522 

There is a second set of tables provided which give the exact 
z-scores for a limited number of tail probabilities. 

These enable us to work out the z-score corresponding 

to a particular proportion. 


For example, consider the z-value corresponding to the top 5%. -4 3 -2 -- 0 1 2 3 


p | 0.75 090 0.95 | 0.975 0.99 0.995 | 0.9975 0.999 0.9995 


z | 0.674 1.282 1.645 | 1.960 2.326 2.576 | 2.807 3.090 3.291 


sz Standard normal distribution 


The table shows that z= 1.645 (using p=0.95). 


If you need to find a z-score corresponding to an unlisted 
probability, you can use the main table to find an 
approximation. 


For example, consider the z-score with ®(z) =0.5885. 


2 
z 0 1 2 2 4 5 6 7 8 9 Mt 8 i rae 8 7 : 2 
0.0 | 0.5000 | 0.5040 0.5080 0.5120)}0.5160 0.5199 0,5239/0.5279 0.5319 0.5359 |4 8 12 |16 20 24 |28 32 36 
0.1 | 0.5398 | 0.5438 0.5478 0.5517 |0.5557 0.5596 0.5636/0.5675 0.5714 0.5753 |4 8 12 |16 20 24 | 28 32 36 
0.2 | 0.5793 | 0.5832 0.5871 0.5910 }0,.5948 0.5987 0.6026 /|0.6064 0.6103 0.6141 |4 8 12 |15 19 23 |27 31 35 
0.3 | 0.6179 |0.6217 0.6255 0.6293 |0.6331 0.6368 0.6406 /|0.6443 0.6480 0.6517 |4 7 11 |15 19 22 |26 30 34 
0.4 | 0.6554 |0.6591 0.6628 0.6664 0.6700 0.6736 0.6772/|0.6808 0.6844 0.6879 |4 7 11 |14 18 22 |25 29 32 


Even using the third decimal place corrections, there is no z-score which 
gives exactly 0.5885. The nearest we can get is 0.5886 when z is 0.224. 


Exercise 9.2 
All of these questions relate to the standard normal distribution, i.e. Z ~ N(0, 1°). 


1. Find a) ©(-0.06) b) (2.63) ©) (4) d) 0(2.5)-0(1.2) e) (1.43) — &(-1.03). 
2. Calculate a) P(Z<1.08) b) P(Z>-0.3) —c) P(Z<-0.72)_—d) o(3<z<2), 


3. Find the probabilities shown by the shaded areas. 
a) 0.5 4f(x) b) 


c) 0.5 4f(x) 
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4. Find 
a) P(-1.8<Z<1.8) 


b) P(-0.72 <Z<0.72) 


c) P(Z<-2.8 or Z>2.1) 
d) P(Z<14 or Z>1.7). 


5. Calculate the z-scores which cut off 
a) the top 10% 
b) the top 0.5% 
c) the bottom 2.5% 
d) the bottom 20% 
e) the top 6% 
f) the bottom 1.7%. 


9.3, Calculating probabilities for the N(y, o*) distribution 


All normal distributions are essentially the same shape — they may 
have a different centre, or be more peaked, but they can all be 
standardised to the N(0, 1) distribution. 


0.1 9 f(x) 


m8 N(25, 25) 


0.06 


0.04 
N(35, 100) 


0.02 


For a distribution X ~ N(u, 0”), we can find the probability 
of X taking a range of values through the following. 


e First calculate z by using z =k 


e Then use the table of probabilities to find ®(z). 
e@ Determine the probability required, referring to a sketch. 


Hay Calculating probabilities for the N(w, 0”) distribution 


Example 5 
X~N(2, 5’). 
Find a) P(X<7) b) P(X>11) c) P(\x|<3) 
d) P(\x—2|<6) e) xsuch that P(X>x)=0.05. 
a) P(X<7) b) P(X>11) 
zoho z-24=1 gaat =z=t= 2-18 
P(X<7)=P(Z<1)=(1)=0.8413 P(X>11)=P(Z>1.8)=1— O(L.8) 
= 1—-0.9641=0.0359 


0.1 4 f(x) 


20-15 -20 -15 -10 -5 0 
P(|X|<3) d) P(|x-2|<6) 


|x] <3 =>-3<x<3 |x—2|<6=>-4<x<8 


= =4—2 
qeS=-1 z= i 4 == 12 Z, : 
P(—3<X < 3)=P(—1< Z <0.2) P(-4 < X < 8)=P(-1.2 < Z< 1.2)=@(1.2) 
=(0.2)—® (-1)=0.5793 —(1 —0.8413)=0.4206 —@(-1.2)=0.8849 —(1—0.8849)=0.7698 


0.1 >f(x) peti 


— 


r 
-20 -15 -10 -5 


r 71 
-20 -15 -10 -5 
x such that P(X>x)=0.05 

(z)=0.95 = z=1.645 Use the table of 

percentage points to find 

z=1.645. 


X=p + Zo => x=24+1.645X5=10.225 
so x=10.2 (3 s.f.) 
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If we are given enough information, we can work 
out y or a or both. 


Example 6 (z)=0.95 = z=1.645 
X is normally distributed such that X ~ N(u, 36). } x="+zo => 159.3="+1.645x6 
Also, it is known that P(X>159.3)=0.05, = p=159.3 — 1.645 x6= 149.43 
Calculate the value of y correct to 1 decimal so 4=149.4 (1 d.p.) 
place. oat 
0.08 
0.06 
0,04 
0.02 


Example 7 $ O(z)=0.96 > z=1.751 
X ~ N(37.1, & ), P(X>51.3)=0.04. x=yp+zo => 51.3=37.14+1.751 xo 
Calculate the value of o. as Alesse = 8.1096. 
1 : ae 
=8.11 (3s) 

0.1 7 f(x) 

0,08 

0.06 

0,04 

0.02 


His3y Calculating probabilities for the N(y, 0”) distribution 


Example 8 
X ~N(u, 0°), P(X <37) = 0.1, P(X > 49.3) = 0.2. 
Calculate the values of » and o. 


O(z,)=0.1 = z,=-1.282 


O(z,)=0.8 => z,=0.842 


Using x=p+zo > 
49.3="+0.842xX0 
37=y"-1.282xo0 


Subtracting > 12.3=2.1240 


Bs 
US Ter 


#=37+1.282x5.7909...=44.424... 
= 44.4 (3 sf.) 


=5.793...=5.79 (3 s.f.) 


Exercise 9.3 
1. X~N(47,5°). 
Find a) P(X<56) b) P(X>51) c) P(X<42). 
2. X~N(32, 16) 
Find a) P(X<30) b) P(X>25) c) P(X<30.3). 
3. X~N(4,9) 
Find a) P(X<10) b) P(X>5) c) P(|X|<3). 
4. X~N(1750, 165°) 
Find a) P(X<1750) b) P(X>1780)_ c) P(|X—1750|<165). 
5. X~N(0, 20) 
Find a) P(X<10) b) P(X>2.5) c) P(-3<X<5). 
6. X~N(25, 16) 
Find a) xsuch that P(X>x)=0.05 b) ysuch that P(X < y) = 0.6. 
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7. X~N(83.2, 4.57) 
Find a) xsuch that P(X >x)=0.01 
b) ysuch that P(X<y)=0.3 
c) zsuch that P(X < z)=0.78. 
8. X~N(0, 15) 
Find a) xsuch that P(|X|<x)=0.8 
b) ysuch that P(|X|>y)=0.6. 


9. X~N(, 5°), P(X>23.4)=0.05. 
Calculate the value of p. 


10. X ~ N(42, 0°), P(X >48,3)=0.01. 
Calculate the value of o. 


11. X ~ N(u, 16), P(X>—3)=0.98. 
Calculate the value of y. 


12. X ~ N(186, 0”), P(X<193)=0.92. 
Calculate the value of o. 


13. X~ N(-32, 0”), P(X<-31.3)=0.9. 
Calculate the value of o. 


14. X ~N(, 0), P(X<27)=0.2, P(X>35)=0.3. 
Calculate the values of y and o. 


15. X ~N(, 0”), P(X<78)=0.6, P(X>89)=0.2. 
Calculate the values of v and o. 


16. X~N(u, 0°), P(X<2.1) =0.6, P(X<2.7)=0.7. 


Calculate the values of u and o. 


17. X ~N(u, 6”), P(X>1056)=0.6, P(X>1132)=0.2. 


Calculate the values of y and o. 


18. X ~ N(u, 0°), P(X<47.3)=0.5, P(X>52)=0.2. 


Calculate the values of u and o. 


ty Calculating probabilities for the N(y, 0”) distribution 


9.4 Modelling with the normal distribution 


The introduction to this chapter refers to the real-life uses of the normal 
distribution. This section ties together the techniques covered so far, 
describes how the normal distribution is used in practice and how to 
evaluate whether it is an adequate model for a given situation. 

Consider the characteristics of the normal distribution. 

It is symmetric. 

It is infinite in both directions. 


It has a single peak in the centre. 


It is continuous. 


Histograms of very many observed populations of naturally occurring 
phenomena have roughly the bell shape which defines the normal 
distribution, but many of those populations cannot possibly have an exact 
normal distribution. It is a judgement call as to when the use of it is a useful 
model because it is a reasonable model, i.e. it is a good approximation to 
reality. Reasons it may not be exact include the following: 


e Some distributions are skewed. If you look back at Chapter 3 where 
you looked at representing lots of data sets graphically, performance in 
sporting events tends to be skewed — weak performances have longer tails 
than is possible for the best performances; average starting salaries for 
different levels of education commonly have a positive skew. The normal 
distribution may still be a reasonable model for populations which are not 
perfectly symmetric, but it should not be grossly skewed. 


e Many (most) physical measurements do not take negative values, so 
the distribution is truncated at zero — and will not be infinite in both 
directions. However, where the mean is much greater than the standard 
deviation, the effect of this is negligible. 

e Counts of occurrences, and any monetary context, are actually 
discrete. However, where the number of possible outcomes is large, the 
(continuous) normal distribution will provide a reasonable model — 
provided the other conditions above are also reasonably met. 
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Example 9 
The lengths of steel girders produced in a factory are normally Did you know? 


distributed with a mean length of 12.5m and a variance of 0.0004 m’. In real life, we often 
Girders need to be between 12.47 and 12.53 metres to be used in encounter numbers 
construction. wile lappedt 
‘ unnecessarily awkward. 
a) Find the proportion of girders which cannot be used for construction. _ However, having steel 
Girders are extremely expensive to produce, and the company is not girders almost exactly 
the same length is vitally 


happy with this level of wastage. A new machine is installed which A 
i s y laseaen P important for the safety 
reduces the variance of the production to 0.0002 m’ while maintaining of the building. 
a normal distribution with the same mean as before. 
b) Find the proportion of girders produced by the new machine 
_Which cannot be used for construction. 


Ay The standard deviation ie: ioredtnata is 0.02 m, or 2cm. 305 fix) 
X ~ N(12.5, 0.02?) 
, = AT=125 ys, 7, = 1253-125 1 5 eth 
0.02 0.02 
P(12.47<X<12.53)=P(-1.5<Z<1.5)=@(1.5)—@(-1.5) 10 
= 0.9332—(1—0.9332) =0.8664 4 
13.4% of the girders cannot be used. [eB ee ie Gh ably 
b) The standard deviation of production is now 0.01 m, or 1 cm. cite 
X ~ N(12.5, 0.012) 
= 1247-125 __4. 7 1253-125 _ 304 
Z 0.01 oa 0.01 : 
P(12.47 <X<12.53)=P(-3<Z<3)=0(3)—®(-3) m 
= 0,9987—(1—0,9987)=0.9974 
104 


Now only 0.26% of the girders from the new machine cannot 
be used. x 
123 124 125 126 127 


‘ym Modelling with the normal distribution 


If we have historical exam performance data, we can predict the 
proportion of students that will get a particular grade. 


Example 10 


In an examination the marks are normally distributed with mean 60.7 and a standard 
deviation of 12.3. 


a) A candidate needs a mark of at least 40 to pass. 
Approximately what percentage of candidates fail? 
b) The board awards distinctions to the best 10% of the candidates. 


What is the least mark a candidate will need to get a distinction? 


c) The list of passing candidates is published the day before the list of 
distinctions. What is the probability that a candidate who has passed will 
have got a distinction? 


d) Evaluate the assumption that the marks are normally distributed in this context. 


a) X~N(60.7, 12.3?) 0,04 5 f(x) 
z = 40=80.7 — _) 6829... 
12.3 


P(fail)=P(X<40)=P(Z<-1.6829...) 
=1-—9(1.683)=1—0.9538=0.0462 


so approximately 5% of the candidates fail. + ‘- a 


10 20 30 40 80 90 100 


b) X ~ N(60.7, 12.37) 
(z)=0.9 = z=1.2816 
x= 60.7 + 1.2816 x 12.3=76.46368 


so a candidate needs to score at least 77 to get a distinction. 


¢) This is a conditional probability, but a special sort — 
the set of candidates who get a distinction are a subset 
of the candidates who pass the exam. 
Therefore P(D M Pass)=P(D), and 


> 2(D)) on een 
P(D| Pass) = 5 pass = gi9338 = 0-1048--=10.5% 


d) The distribution given can only be an approximation, since 
marks will be non-negative integer values only. The normal, 
with the same mean and variance as the historical data, will 
often be a good model for the distribution of the marks. 
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Companies producing packaged foods are usually 
required by law to state the estimated weight of the 
contents. They need to be able to show that it is 

rare for their products to contain less than what it 

says on the packet. However, all production 

processes are subject to variation. Standard deviation 

is a measure of the variability of a distribution. In 
manufacturing, this is usually dependent on the 

quality of the machinery used, where the mean can be 
adjusted by setting controls — so it is reasonable that the 
standard deviation may be known even if the mean is not. 


Examples 9, 10, 11 and 12 are all contexts in which negative values of the 
variable would be impossible in context. In each case, the specified normal 
distribution would give a positive (although tiny) probability. This is why 
we talk about modelling a situation by the normal distribution — it is a 
useful description even though it is not a perfect description. 


Example 11 


A machine pours melted chocolate into moulds. 

The standard deviation of the amount it pours is 0.7 grams, and the mean amount 
can be set on the machine. 

The amount poured may be assumed to follow a normal distribution. 

The machine is to produce bars whose label says 50 g of chocolate. 

The company’s lawyers want to have no more than 0.5% of bars containing 

less than the advertised weight. 

What should the mean be set at? 


deeeeecescccecereccrcecescssesoece deeccccccnsecvecccccescnsesocesesnscnsccsccnsesccencccsscssesscescesoed 


0.5% is equivalent to a probability of 0.005. 0.6 5 f(x) 
(z)=0.005 = z=-2.5758 05 
x=u"+zs => 50=y -2.5758 x 0.7 0.4 
=> 1=50+2.5758 x 0.7=51.80306=51.8 (3 s.f.) Be 
So the mean should be set at 51.8 g. 02 
0.1 


48 49 50 51 52 53 54 55 


Modelling with the normal distribution 


Example 12 


An airline does a survey of the weights of adult passengers travelling on its flights. 

It finds that 5% weigh more than 84.3 kg and 2% weigh less than 57.2kg. 

Assuming that the weights of adult passengers on its flights are normally distributed, 
find the mean and standard deviation of the weights. 


Oz, 2.054 (z,) =0.95 = z, = 1.6449 
using x=y+zo => 0.1 4) 
84.3 =u+1.6449 x o 0.08 
57.2 ="—-2.054 xo 0.06 


Subtracting => 27.1=3.69890 Oi 


er Vale = 0.02 2% = 
Ora 7.3265... = 7.33 (3 s.f.) F x 


= 57.2+2.054 x 7.3265... = 72.2486... = 72.2 G sf.) 40 45 50 55 60 65 70 75 80 85 90 95 100 


Exercise 9.4 


1. IQ scores are normally distributed and are measured on a scale which has a 
mean of 100 and a standard deviation of 15. Find the IQ, X, which is exceeded 
by only 5% of the population. 


2. The lengths, L metres, that Philippe jumps in the triple jump event may be 
modelled by a normal distribution with mean 15.85 m and variance 0.36 m’. 
a) Fora jump chosen at random, find the probability that he jumps at least 16.2 m. 
In a competition, there is a qualifying distance of 16.2 m. 
b) Find the probability that he jumps over 16.5 m, given that he makes the qualifying 
distance with the jump. 
3. The diameter of pistons produced in a factory are Dcm, where D ~ N(u, 0.127). 
a) Given that 5% of the pistons have a diameter less than 13.20cm, show that 
w= 13.4. 
‘The tolerance specified for the pistons is that the diameter needs to be at least 
13.35 cm and not more than 13.5cm. 
b) What proportion of the production in the factory meets these 
tolerance limits? 
Three pistons are chosen at random. 
c) What is the probability that none of them meet the tolerance limits? 
d) Evaluate the assumption that the diameters of the pistons are normally distributed 
in this context. 
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4. The time, T minutes, taken for the employees of a large firm to travel 
to work can be assumed to be normally distributed. Of the employees, 
10% take at least 40 minutes to travel and 0.5% take less than 8 
minutes. 

a) Find the mean and standard deviation of T. 
b) Evaluate the assumption that the travel times are normally 
distributed in this context. 

5. The operational life of batteries produced by one manufacturer (A) is 
normally distributed with mean 43 hours and standard deviation 4 
hours. 

a) Find the probability that a randomly chosen battery operates for 
more than 40 hours. 

b) What length of operational life is exceeded by 10% of batteries 
from this manufacturer? 

A rival manufacturer wants to be able to claim that 95% of its batteries 

last longer than the average produced by manufacturer A. Their 

process also has a standard deviation of 4 hours. 

c) What mean will their manufacturing process need to have to be 
able to make this claim? 


6. A machine fills tins with an amount of liquid which is normally 
distributed, with mean 330 ml and standard deviation 8 ml. 
a) Find the probability that a tin contains less than 320 ml. 
b) Find the probability that a tin contains between 320 and 345 ml. 
c) What volume is exceeded by 5% of the tins? 


Another machine also distributes the same liquid with amounts which 

follow a normal distribution. 

d) Ifthe standard deviation is still 8 ml, what should the mean 
amount be set to if only 5% is to be below 320 ml? 


e) Ifthe mean is to stay at 330 ml, and only 5% is to be below 320 ml, 
what would the standard deviation have to be? 


7. The weights of eggs from one farm are normally distributed with mean 
53 g and standard deviation 4g. 


a) What is the probability that an egg from this farm weighs more 
than 56 grams? 

Eggs which weigh less than 48 grams are removed and not sold. 

b) What is the probability that an egg sold from this farm weighs 
more than 56 grams? 


Modelling with the normal distribution 


8. A company employs a large number of administrative staff. When 
the company wants to employ new staff, candidates are given a 
standard task to complete, and the time they take to complete the 
task is recorded. It is observed that the times taken by candidates are 
normally distributed with mean 360 seconds and standard deviation 
75 seconds. 


a) i) What proportion of the applicants take longer than 
450 seconds? 


ii) What proportion of the applicants take between 210 seconds 
and 450 seconds? 
iii) What time is exceeded by 5% of the candidates? 
Candidates who take longer than 450 seconds are automatically 
rejected and those who take less than 210 seconds are automatically 
accepted. The remainder are interviewed. 
b) What is the median time taken by those applicants who are 
interviewed? 


9. The volume of oil poured into cans in a production process is known 
to have a standard deviation of 2.6 ml. 
Of the cans, 22% contain less than 660 ml. 
a) Calculate the mean volume of oil in the cans. 


b) Calculate the proportion of cans which contain less than 650 ml. 


10. The screws produced by a company have mean length 17.9 mm and 

variance 0.04mm’. 

a) What proportion of the screws are less than 17.5 mm? 

Any screws which are less than 17.5mm or greater than 18.4mm 

cannot be sold. 

b) Ina batch of 1000 screws, how many would you expect to be 
rejected? 

The mean length can be adjusted by a setting on the machine. 

c) At what should the mean be set in order to maximise the 
proportion of screws satisfying the specifications? 

If a screw is too short it has to be scrapped, but if it is too long it can be 

filed to make it usable. The mean is set at 17.9 mm. 

d) A screw is chosen at random and found to be long enough not to 
be scrapped. Evaluate the probability that it will need to be filed 
before it can be sold. 
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Summary exercise 9 


EXAM-STYLE QUESTION 


1. 


IQs are measured on a scale with a mean of 

100 and standard deviation 15. Assuming 

that IQs can be modelled by a normal 

random variable, find 

a) P(Y>130) 

b) P(73<Y<91) 

c) the value of k, to 1 decimal place, such 
that P(Y<k)=0.2. 


The performances in the 100 metre sprint 
of a group of decathletes are modelled by 

a normal distribution with mean 11.46 
seconds and a standard deviation of 0.32 
seconds. The performances of this group of 
decathletes in the shot put are modelled by a 
normal distribution with mean 11.62m and 
standard deviation 0.73 m. 


a) Find the probability that a randomly 
chosen athlete runs the 100 metre sprint 
faster than 10.8 seconds. 

Find the probability that a randomly 
chosen athlete puts the shot further 
than 12.8m. 

Assuming that for these decathletes 

the performances in the two events are 
independent, find the probability that 

a randomly chosen athlete runs the 

100 metre sprint faster than 10.8 seconds 
and puts the shot further than 12.8m. 
Evaluate the assumption that the 
performances in the two events are 
independent. 


b) 


c) 


d) 


EXAM-STYLE QUESTIONS 
3. Bottles of spring water have a stated volume 


of 330 ml. The bottles are filled by a machine, 
which dispenses a volume which is normally 
distributed with mean 335 ml and standard 
deviation 5 ml. 


Summary exercise 9 


a) Find the probability of a bottle 
containing less than the stated volume. 

The bottles have a capacity of 345ml. Any 

time that the machine dispenses more than 

this, the extra spring water spills out. 

b) Find the probability of a bottle 
containing less than the stated volume, 
given that it didn’t overflow. 

A new machine is installed. Only 0.5% of 

bottles overflow when the mean amount 

dispensed by the new machine is set to 335 ml. 

c) Find the standard deviation of the 
amount of spring water dispensed by the 
new machine. 


A random variable X has a normal 
distribution. 


a) Describe two features of the distribution 
of X. 

A company produces batteries which have 

lifespans that are normally distributed. 

Only 2% of the batteries have a lifespan 

less than 230 hours and 5% have a lifespan 

greater than 340 hours. 

b) Determine the mean and standard 
deviation of the lifespans of the batteries. 

The company gives a warranty of 250 hours 

on the batteries. They make a profit of 

$750 on each battery that they sell. Replacing 

a battery under warranty costs the company 

$1250. 

c) Find the average profit they make on the 
sale of 100 batteries after any replacements 
under warranty have been made. 

The random variable X is normally 

distributed with mean 253 and variance 121. 

Find 

a) P(X<240) 

b) P(245<X<275). 


7. 


It is known that P(a<X)=0.13. 
c) Find the value of a. 


The length of time it takes an examiner to 
mark an examination script may be modelled 
by a normal distribution with mean 8 minutes 
and standard deviation 90 seconds. 


(a) Find the time, tf minutes, such that 
one script in six will take the examiner 
longer than f minutes to mark. 


(b) Evaluate the assumption that the 
marking times are normally distributed 
in this context. 


An examination in Statistics consists of a 

written paper and a project. 

Marks for the written paper, E, may be 

modelled by a normal distribution with 

mean 62 and standard deviation 9. 

Marks for the project, F, may be modelled 

by a normal distribution with mean 70 and 

standard deviation 6. 

a) Find P(E>80). 

b) Find p such that P(F>p)=P(E>80). 

A distinction in the examination requires 

at least 75 in both the written paper and the 

project. 

c) Find the probability that a candidate 
gets a distinction, assuming that the 
performance on the written paper and 
the project are independent of one 
another. 

d) Comment on the assumption of 

independence in part (c). 


: EXAM-STYLE QUESTION 


8. 


The random variable X ~ N(u, 07). 
It is known that 
P(X<69)=0.0228 and P(X > 95)=0.1056. 
a) i) Show that the value of cis 8. 
ii) Find the value of wv. 


b) Find P(71<X<81). 


An electronic component has a useful 
lifespan which can be modelled by a normal 
distribution with mean 8000 hours and a 
standard deviation of 400 hours. 

a) Calculate the probability that a randomly 
selected component will last 


i) less than 7700 hours 
ii) between 7500 and 8300 hours 


iii) at least a year if it is installed on 
January 1st 2007. 


he components cost the manufacturer 

215 to produce. The manufacturer offers an 
ptional guarantee at an extra cost of $50. 

he terms of the guarantee are that the 
manufacturer will replace the component 
free of charge if its useful lifespan is less than 
7000 hours, and at a cost of $75 to the 
customer if its lifespan is between 7000 

and 7500 hours. 

b) Calculate the expected profit or loss on 
each guarantee sold by the manufacturer. 


a 


° 


EXAM-STYLE QUESTIONS 


10. IQs are measured on a scale with a mean of 


11. 


100 and a standard deviation of 15. 


a) Assuming that IQs can be modelled by a 

normal random variable Y, find 

i) P(Y>120) 

ii) P(75<Y<94). 

b) What is the lowest IQ that places a 
person in the top 0.5% of the population? 


Bottles of milk have a stated volume of 

500 ml. The bottles are filled by a machine 

which dispenses a volume which is normally 

distributed with mean 505 ml and standard 

deviation 8 ml. 

i) Find the probability of a bottle containing 
less than the stated volume. 


The normal distribution 


The bottles have a capacity of 525ml. Any 
time that the machine dispenses more than 
this, the extra milk spills out. 


12. The height of coffee plants on a smallholding 
in Kenya is normally distributed with mean 
3.2m and standard deviation om. The 
proportion of plants that have a height greater 
than 3.4m is 1.5%. Find the value of o. 


ii) Find the probability that a bottle contains 
less than the stated volume, given that it 


didn't overflow. 13. A random variable X is normally distributed 


with mean y and standard deviation o. 
It is given that P(X <310) = 0.1 and that 
P(X>340) =0.05. 


Find y and o. 


A new machine is installed. Only 0.1% of 
bottles overflow when the mean amount 
dispensed by the new machine is set to 505ml. 


iii) Find the standard deviation of the amount 
of milk dispensed by the new machine. 


Chapter summary 
Continuous probability distributions and the normal distribution 


e The normal distribution is continuous, symmetric, infinite in both directions and has a single 
peak at the centre. 


© 95% of values lie within approximately 2 
standard deviations of the mean; 


© 99% lie within approximately 
3 standard deviations of the mean. , 
e To find the standardised score, z, from a raw score, x, use the <— 95% —> 
conversion z = a where y# is the mean and ais the 
standard deviation of the raw scores. x=,1+zo can be used to 
convert standardised scores back to raw scores. 


e The probability tables for the N(0, 1) distribution give the cumulative probability D(z) for 
non-negative values of z. The symmetry of the distribution allows the values of ®(z) for 
negative z to be deduced from these. All probabilities can then be worked out using one or 
two values from the tables. 


Standard normal distribution 
e The normal distribution is written as X ~ N(u, ©). 


e ‘The standard normal distribution has mean 0 and variance 1. 


Calculating probabilities for the N(y, o”) distribution 


e Calculating probabilities for the N(y, o°) distribution is done by standardising the scores and 
using the standard normal distribution tables. 

e Calculating an unknown mean and/or standard deviation is done by constructing one or 
two equations involving the unknowns from the probability information provided and then 
solving for the unknown(s). 


Summary exercise 9 


Tables of the normal distribution 


If Z has a normal distribution with mean 0 and variance | then, 

for each value of z, the table gives the values of P(z), where 
O(z)=P(Z<z). 

For negative values of z use O(—z) = 1 — ®(z). 


2) 


z|o 1 2 3 4 5 6 7 8 | 9-2 4 & ex on ee 
0.0 {0.5000 |0.5040 0.5080 0.5120 |0.5160 0.5199 0.5239 |0.5279 0.5319 0.5359 |4 8 12 |16 20 24 |28 32 36 
0.1 {0.5398 |0.5438 0.5478 0.5517 |0.5557 0.5596 0.5636 |0.5675 0.5714 0.5753 |4 8 12 |16 20 24 |28 32 36 
0.2. |0,5793 | 0.5832 0.5871 0.5910 0.5948 0.5987 0.6026 |0.6064 0.6103 06141 |4 8 12 |15 19 23 |27 31 35 
0.3 |0.6179 | 0.6217 0.6255 0.6293 |0.6331 0.6368 0.6406 |0.6443 0.6480 0.6517 |4 7 11 |15 19 22 |26 30 34 
0.4 {0.6554 |0.6591 0.6628 0.6664 |0.6700 0.6736 0.6772 |0.6808 0.6844 0.6879 |4 7 11 |14 18 22 |25 29 32 
0.5 |0,6915 | 0.6950 0.6985 0.7019 |0.7054 0.7088 0.7123 |0.7157 0.7190 0.7224 |3 7 10 |14 17 20 |24 27 31 
0.6 |0.7257 |0.7291 0.7324 0.7357 |0.7389 0.7422 0.7454 | 0.7486 0.7517 0.7549 |3 7 10 |13 16 19 |23 26 29 
0.7 0.7580 |0.7611 0.7642 0.7673 |0.7704 0.7734 0.7764 |0.7794 0.7823 0.7852 |3 6 12 15 18 |21 24 27 
0.8 |0.7881 |0.7910 0.7939 0.7967 |0.7995 0.8023 0.8051 |0.8078 0.8106 0.8133 |3 5 1H 14 16 |19 22 25 
0.9 {0.8159 |0.8186 0.8212 0.8238 |0.8264 0.8289 0.8315 |0.8340 0.8365 0.8389 |3 5 8 |10 13 15 |18 20 23 
1.0 |0.8413 |0.8438 0.8461 0.8485 |0.8508 0.8531 0.8554 |0.8577 0.8599 0.8621 |2 5 7/9 12 14 |16 19 21 
1.1 0.8643 | 0.8665 0.8686 0.8708 |0.8729 0.8749 0.8770 |0.8790 0.8810 0.8830 |2 4 6| 8 10 12 |14 16 18 
1.2 [0.8849 | 0.8869 0.8888 0.8907 |0.8925 0.8944 0.8962 |0.8980 0.8997 0.9015 |2 4 6|7 9 11 |13 15 17 
1.3. |0.9032|0.9049 0,9066 0.9082 |0.9099 0.9115 0.9131 |0.9147 0.9162 0.9177 |2 3 5|6 8 10 |11 13 14 
1.4 |0.9192 | 0.9207 0.9222 0.9236 |0.9251 0.9265 0.9279 |0.9292 0.9306 0.9319 |1 3 4/6 7 8 |10 11 13 
1,5. |0.9332|0.9345 0.9357 0.9370 |0.9382 0.9394 0.9406 |0.9418 0.9429 0.9441 |1 2 4/5 6 7/8 10 11 
1.6 |0.9452 | 0.9463 0.9474 0.9484 |0.9495 0.9505 0.9515 |0.9525 0.9535 0.9545 |1 3/4 5 6/7 8 9 
1,7 |0.9554 |0.9564 0.9573 0.9582 |0.9591 0.9599 0.9608 |0.9616 0.9625 0.9633 |1 2 3|4 4 5/6 7 8 
1.8 |0.9641 | 0.9649 0.9656 0.9664 |0.9671 0.9678 0.9686 |0.9693 0.9699 0.9706 |1 1 2|3 4 4/5 6 6 
1.9 |0.9713|0.9719 0.9726 0.9732 |0.9738 0.9744 0.9750 |0.9756 0.9761 09767 |1 1 2/2 3 4/4 5 5 
2.0 |0.9772|0.9778 0.9783 0.9788 |0.9793 0.9798 0.9803 |0.9808 0.9812 0.9817]0 1 1/2 2 3/3 4 4 
2.1 |0,9821 |0.9826 0.9830 0.9834 |0.9838 0.9842 0.9846 |0.9850 0.9854 0.9857 |0 1 1/2 2 2/3 3 4 
2.2 |0.9861 |0.9864 0.9868 0.9871 |0.9875 0.9878 0.9881 |0.9884 0.9887 0.9890 |0 1 1/1 2 2/2 3 3 
2.3 |0.9893 |0.9896 0.9898 0.9901 |0.9904 0.9906 0.9909 |0.9911 0.9913 09916 |0 1 1/1 1 2/2 2 2 
2.4 |0.9918 |0.9920 0.9922 0.9925 |0.9927 0.9929 0.9931 |0.9932 0.9934 0.9936 /0 0 1/1 1 1/1 2 2 
2.5 |0.9938 |0.9940 0.9941 0.9943 |0.9945 0.9946 0.9948 |0,.9949 0.9951 0.99520 0 O}1 1 1/1 1 1 
2.6 |0.9953 | 0.9955 0.9956 0.9957 |0,9959 0.9960 0.9961 |0.9962 0.9963 09964 |0 0 0/0 1 1/1 1 1 
2.7 |0.9965 |0.9966 0.9967 0.9968 |0.9969 0.9970 0.9971 |0,.9972 0.9973 0.99740 0 0|0 0 1/1 1 1 
2.8 |0.9974|0.9975 0.9976 0.9977 |0.9977 0.9978 0.9979 |0,9979 0.9980 0.9981 |0 0 0|0 0 0 ol 
2.9 {0.9981 | 0.9982 0.9982 0.9983 |0,9984 0.9984 0.9985 |0.9985 0.9986 0.9986 |0 0 010 0 0 oo 


Critical values for the normal distribution 
If Z has a normal distribution with mean 0 and variance | then, for each value 
of p, the table gives the value of z such that 


P(ZSz)=p. 
Pp 075 0.90 095 | 0975 0.99 0.995 | 0.9975 0.999 0.9995 
z 0.674 1.282 :1.645 | 1960 2.326 2.576 | 2.807 3.090 3.291 


The normal distribution 


171 


10 The normal approximation to 
the binomial distribution 


This is a particular case of a very important 
theorem, the central limit theorem, which says 
that the distribution of means of samples of 
size n from any underlying distribution will 
be approximately normal if 1 is large enough. 
The fundamental stability and consistency of 
the behaviour of large numbers of individually 
random events is the basis for some financial 
| products and of insurance markets. 


Objectives 

e Recall conditions under which the normal distribution can be used as an approximation 
to the binomial distribution (1 large enough to ensure that np > 5 and ng > 5), and use this 
approximation, with a continuity correction, in solving problems. 


Before you start 


You should know how to: Skills check: 
1. Find the mean and variance of a binomial 1. IfX ~ B(8, 0.3), find the mean and 
distribution, e.g. if X ~ B(12, 0.6), find the variance of X. 


mean and variance of X. 


Mean = 12 x 0.6 =7.2 
Variance = 12 x 0.6 x 0.4 = 2.88 


2. Calculate probabilities using the normal 2. X ~N(4, 9). Find 
distribution, e.g. a) P(X < 10) 
X ~ N(30, 16). b) P(X>5) 
Find a) P(X<20) b) P(X<35). ©) P(IX|<3). 


a) P(X<20)= p(z< 2 22=-2,5) 
=P(Z>2.5)=1—0.9938=0.0062 


b) P(X<35)= oz <2 1.25) = 0.8944 


10.1 Normal shape of some binomial distributions 


Discrete random variables take only particular values, each with its own probability. 
Continuous random variables take values over an interval, and probabilities 
are defined for ranges of values rather than individual values. 


When there are a large number of possible values for a discrete distribution 
there can be a lot of calculations involved. 


However, if we consider the graphs of some of the distributions, their shape 

looks similar to the normal distribution and suggests that we could use the 
corresponding normal distribution to reduce the work in calculating probabilities. 
There is a trade-off between only getting an approximation to the probability 

and doing much less work. 


n= 20, p= 0.25 


012345 67 8 9 1011121314151617181920 
Number of successes 


In Section 7.3 we saw the binomial distributions of B(20, 0.25) and 
B(20, 0.9) (reproduced below). B(20, 0.25) looks approximately 
normal, but B(20, 0.9) does not. Hence the shape being approximately 
normal depends not only on n and p but also on n and q. 


n= 20, p= 0.25 n=20, p-0.9 
0.35 0.35 9 
03 03 4 
2 0.25 + 0.25 4 - 
B 02 0.2 
2 0.15 0.15 + —HH}— 
0.14 0.1 
0.05 0.05 
0 0 


a a 
012345 67 8 9 1011121314151617181920 012345 67 8 9 1011121314151617181920 
Number of successes Number of successes 


The normal approximation to the binomial distribution 


Exercise 10.1 
1. Find the probabilities of all possible values for B(6, 0.5) and represent 
these graphically as above (i.e. showing gaps between bars). 


2. Find the probabilities of all possible values for B(6, 0.1) and represent 
these graphically as above. 


10.2 Continuity correction 


In the diagrams in Section 10.1 the discrete probabilities are shown as bars with gaps 
between them. In fact, the bars really should have zero width since the probability 
occurs only at the integer value, in which case the diagrams would not look so much 
like the normal. 


If we take ‘7’ in the discrete distribution to be represented by the interval (6.5, 7.5), 
which are the values which round to 7, then the graphs look like: 


n= 20, p=0.5 


0.2 
2 0.15 

0.1 
0.05 


Probabilitity 


2 4 6 8 10 12 14 16 18 20 
Number of successes 


The resemblance to the normal distribution is now even stronger. 


When the normal is used to approximate the binomial (or any other 


distribution that takes only integer values) we must use a continuity 
correction. 


If we want P(X > 7), we must use a cut-off for the normal at 7.5. 

For P(X < 7) or P(X = 7) the cut-off would be at 6.5. 

‘The easiest way to determine whether it is at +0.5 or at —0.5 is to consider 
which two integers are to be separated. 


Hence, P(X=x)=P(x-0.5<Y<x+0.5), 
where Y is the approximating normal random variable. 


Continuity correction 


Example 1 


Let X ~ B(n, p) and Y ~ N(np, npq), where n, p satisfy the conditions needed for 

Y to be used as an approximation for X, and q = 1 —p. 

Write down the probability you need to calculate for Y (including the continuity correction) 
as the approximation for each of the following probabilities for X. 


P(X<15) b) P(X>12) c) P(X<17)_— d) P(I2<X< 15). 


P(Y < 14.5) Separates up to 14 from 15 and up. 


P(Y > 12.5) Separates up to 12 from 13 and up. 
P(Y < 17.5) Separates up to 17 from 18 and up. 


P(12.5 < Y< 15.5) The integers which satisfy this are 13, 14 and 15. 


Exercise 10.2 


1. Let X ~ B(n, p) and Y ~ N(np, np(1 — p)), where n, p satisfy the conditions 
needed for Y to be used as an approximation for X. Write down the 
probability you need to calculate for Y (including the continuity correction) 
as the approximation for each of the following probabilities for X. 
a) P(X<42) _b) P(X>31) ce): P(X>9) d) P(42<X<85). 
2. Redraw as histograms, using the continuity correction, the probability diagrams 
you drew for questions 1 and 2 of Exercise 10.1 (i.e. no gaps between bars). 


10.3 The parameters for the normal approximation 


In Section 7.2 we saw that if X ~ B(n, p), then E(X) = np and Var (X) = npg = np(1-p). 


If n is large and p is close to 0.5, so that the distribution is nearly symmetric, 
then you can use the normal distribution to approximate the binomial. 


The parameters to be used are the mean and variance of the binomial: 4 = np; 0° = npq. 


The normal distribution is symmetric, and so is the binomial when p = 0.5, but as n gets 
larger the requirement for p to be close to 0.5 becomes less important. 


The general rule is that the normal distribution can be used as an 
approximation when both mp and nq are > 5. 


The normal approximation to the binomial distribution 


Example 2 
If X ~ B(30, 0.4), calculate P(12 < X < 15) by 


a) calculating binomial probabilities  b) using a normal approximation. 


a) P(I2<X< 15) =P(X=12, 13, 14, 15) 


30 30 a (ey 30 
= 0.4°0.6"% + 0.4°0.6" + 0.40.6" + 0.4°0.6" 
12 13 14 15 


= 0.147375 + 0.136039 + 0.110127 + 0.078312 
= 0.471853 = 0.472 (3s.f.) 

b) For the binomial, 
= np = 30x 0.4=12; o° =npq = 30x 0.4 x 0.6 =7.2, 
so use the N(12, 7.2) distribution to approximate the 
B(30, 0.4) distribution. 
‘The continuity correction says 
P(12 < X< 15) = P(11.5 < Y< 15.5) where Y is the 
approximating normal. 


P(11.5< Y< 15.5) = o( 


= (1.304) — @(-0.186) = 0.9032 —(1-0.5738) = 0.477 (3 s.f.) 


Example 3 

An airline estimates that 7% of passengers who book seats on a flight are ‘no shows’ - for one 
reason or another they miss the flight. On one flight, for which there are 185 available seats, 
the airline sells 197 tickets. 


Find the probability that the airline will not have to refuse boarding to any passengers holding a 
valid ticket for that flight. 


The number of ‘no shows, X, can be modelled by a B(197, 0.07) distribution if we make 

the simplifying assumption that all passengers behave independently of one another (almost 
certainly not entirely the case, but not wildly wrong in most cases — if there were a large group 
travelling together it would not be a good assumption). 

We want P(X > 12). 

Use Y ~ N(197 x 0.07, 197 x 0.07 x 0.93) = N(13.79, 12.8247) to approximate, 

and calculate P(Y > 11.5) using the continuity correction: 


P(Y> 11.5) =P (z-Naee 


12.8287 


} = ®(-0.639) = 0.7386 = 0.739 (3 s.f.). 


The parameters for the normal approximation 


Exercise 10.3 


1. 


Which of the following could reasonably be approximated by a normal distribution? 
For those which can, give the normal which would be used. 


a) X~B(50, 0.7) b) X~B(10, 0.7) c) X~B(500,0.2)  d) X~B(500, 0.002). 
Use normal approximations to calculate 
a) P(X < 42) if X ~ B(50, 0.7) b) P(X>9) if X ~ B(40, 0.3) 
©) P(X > 13) if X ~ B(34, 0.37) d) P(25 <X<37) if X ~ B(80, 0.4) 
e) P(43<X<55) if X ~ B(124, 0.43). 
For X ~ B(50, 0.3) 
a) calculate P(12 < X < 20) 
i) using tables ii) using a normal approximation. 
b) i) Whaterror is there in using the normal approximation? 
ii) Express part (b) (i) as a percentage of the exact probability. 
Explain briefly the conditions under which the normal distribution can be used as an 


approximation to a binomial distribution. If the conditions are satisfied, state what 
normal distribution would be used to approximate the X ~ B(n, p) distribution. 


On a production line, on average 6% of the bottles of lemonade are not filled properly. 

a) If five bottles are examined, find the probability that exactly one of them is not 
filled properly. 

b) If 2000 bottles are examined, use a normal approximation to find the probability 
that less than 100 of them are not filled properly. 


A multiple choice test has 50 questions each with four possible answers. 


a) If Catharine guesses the answer to each question randomly, state the exact distribution 
of X, the number of answers Catharine gets correct. 

b) Ifthe test has a pass mark of 20, find the probability that Catharine passes the test 
using a suitable approximation. 

c) Shopna takes the same test, but knows enough to be able to rule out one of the possible 
answers to each question. Use a normal approximation to find the probability that 
Shopna fails the test. 


A Health Trust gives guidance to doctors that they should show any patients with raised 
blood pressure some information on lifestyle which would help bring down their blood 
pressure. 

The doctors in one surgery ask their research assistant to look at the patient records and 

find that 35% of their patients have raised blood pressure. 

a) The surgery has 84 patients booked for appointments on the following day. 
Assuming that these 84 patients are a random sample from all the patients registered 
with that surgery, find the probability that at least 25 will have raised blood pressure. 

b) Comment on the assumption made in part (a) that the people with appointments at the 
surgery on the following day will be a random sample from all people on the register. 


The normal approximation to the binomial distribution 


Summary exercise 10 


b) Some weeks later the probability of 
success has increased to 0.53. Using 
a suitable approximation, what is the 
probability of 120 or more successes in 
250 drives? 


1. Amil is a dentist who finds that, on average, 
one in five of his patients do not turn up for 
their appointment. 

a) Using a binomial distribution find 
the mean and variance of the number 
of patients who turn up for their 
appointment in a clinic where there were 
20 appointments. 

b) Find the probability that more than 
two patients do not turn up for their 
appointment in the clinic in part (a). 


c) A year later the probability of success has 
increased to 0.92. What is the probability 
of three or fewer failures in 50 drives? 


Among the blood cells of a certain animal 
species, the proportion of cells which are of 


type O is > and the proportion of cells which 


c) Using a suitable approximation find . 
are of type AB is 0.005. 


the probability he sees more than 135 
patients in a week where he has 154 
appointments. 


a) Find the probability that in a random 
sample of eight blood cells at least two 
will be of type O. 


b) Using suitable approximations, find the 
probability that in a random sample of 
200 blood cells the total number of type 
O and type AB cells is at least 81. 


2. A fair coin is tossed repeatedly. Using 
suitable approximations where appropriate, 
find the probability that you would see 
a) more than 7 heads in 10 tosses 
b) more than 70 heads in 100 tosses 


: 6. A tour operator organises a visit for cricket 
c) more than 700 heads in 1000 tosses. 


enthusiasts to India in November. 

3. A fair die is thrown repeatedly. Using 
suitable approximations where appropriate, 
find the probability that you would see 


a) more than 3 sixes in 12 throws 


The package includes a ticket for a one-day 
international in Nagpur. Places on the tour 
must be booked three months in advance. 
From past experience, the tour operator 
knows that the probability of a person who 
has booked a place subsequently withdrawing 
is 0.1 and is independent of other withdrawals. 
a) Twenty-five people book places. Find the 
probability that 


i) none withdraw 


b) more than 30 sixes in 120 throws 


c) more than 300 sixes in 1200 throws. 


EXAM-STYLE QUESTIONS 
4. A golfer practises on a driving range. His 
objective is to hit a ball to within 10m ofa flag. 


a) On his first visit the probability of success 
with each particular ball is 0.3. If he hits 
ten balls what is the probability of 


i) four or fewer successes 


ii) two or more withdraw. 

The tour operator only has 21 tickets 

available for the one-day international. 

iii) What is the probability that he will be 
able to provide everyone who goes on 
tour with a ticket? 


ii) four or more successes? 


Summary exercise 10 


b) An organiser of a similar but larger tour 
accepts 250 bookings but has only 210 
tickets for the one-day international. 
Find, using a suitable approximation, 
the probability that this organiser will 
be able to provide everyone on this 
tour with a ticket. (Assume that the 
probability of a person withdrawing 
remains at 0.1.) 


Use a suitable approximation to find the 
probability that more than 250 germinate. 


9. It is known that 45% of adults in a certain 
country are overweight. A random sample of 
350 adults is chosen. 


Use a suitable approximation to find the 
probability that fewer than 150 in the sample 
are overweight. 


10. The mass of eggs may be modelled by a 
normal distribution with mean 55 grams and 
standard deviation 5.2 grams. 


7. Onacertain road 20% of the vehicles are 
trucks, 65% are cars and the rest are buses. 
i) A random sample of 12 vehicles is taken. 
Find the probability that fewer than 3 are 
trucks. 


A producer classifies an egg as jumbo if it has 

a mass of at least 65 grams. 

i) Find the probability that a randomly 
chosen egg is classified as jumbo. 

ii) A box is filled with 12 randomly selected 
eggs. Find the probability that there are 
no jumbo eggs in the box. 


ii) A random sample of 150 vehicles is now 
taken. Using a suitable approximation, 
find the probability that more than 110 
are cars. 


8. When a lima bean seed is sown in suitable 
conditions it is known that on average 
80% germinate, and they germinate 
independently of one another. A market 
gardener sows 300 of these seeds under 
suitable conditions. 


iii) A large order for 300 eggs comes 
in from a restaurant. Use a suitable 
approximation to find the probability 
that there are fewer than 6 jumbo eggs in 
the order. 


Chapter summary 

e The binomial distribution B(n, p) can be approximated by the normal distribution N(np, npq) 
provided both np and ng = n(1—p) are greater than 5. 

e When the normal distribution is used to approximate the binomial (or any other distribution 
that takes only integer values), a continuity correction must be used. 


The normal approximation to the binomial distribution 


Maths in real-life 


Statistics is definitely not a lonely world 


Most statisticians work in interdisciplinary 
teams — they may work on environmental 
issues, in business and finance, in 
manufacturing, in health, investigating 
effective ways of tackling poverty or in almost 
any interesting human activity - working in 
conjunction with specialists in the area. 


One of the career growth areas in recent years is 
simulation modelling. Computers now offer such 
immense power that complex simulations can be run on even standard 
desktop or laptop computers. Many interesting situations like the spread of 
disease and financial investment risk analysis can be modelled by relatively 
simple steps repeated many times. With the advent of air travel and the 
ability for disease to spread rapidly, these computer models are extremely 
important. 


Many diseases follow the basic SIR model - the population is made up of 
three groups: Susceptible, Infected and Recovered (the recovered group can 
be thought of as including anyone who is immune for any other reason, 
such as vaccination). Each disease will have varying characteristics - 
how long someone is infected for, how likely they are to pass it on to a 
person they come in contact with, etc. A simulation can build these in as 
arameters, so for a disease where a person is infectious for 3 days, and the 
probability of infecting a person on any given day is 0.2, the chance of not 
infecting them in 3 days will be 0.8° = 0.512 
or roughly evens. However, if you want to J ~] 


model the spread in a population you have to ' | 
i per ¥ RY cootistiitine sore’ (S)NO ENTRY cots 22a sone 
s YATVS OVC) entre eromucn oun 17s 


| 


take account of multiple interactions over the 


eriod of time the disease persists. 


Setting up a computer model of the disease 
behaviour and running it repeatedly allows 
ealth professionals to explore different 
possible courses of action they may take to 
intervene in trying to control epidemics. 


Maths in real-life 


Being able to model the patterns of disease 4 
Height [45 
Width [45 
Infectious Period [4 
= Susceptibility [0.125 


is extremely important. For example, Foot 
and Mouth disease is a highly infectious viral 
disease affecting cloven-hoofed animals which 


caused an epidemic in the UK in 2001. The uninfected 

disease is extremely hard to contain and there milicectad 
ae HE Immune 

were 2000 cases in farms across the country. 

Some infected animals had been transported Run Again 


to the continent before the outbreak was 


Click on square to infect Days 


identified and measures were taken to prevent 
the epidemic becoming continent wide — for 
example in Holland they slaughtered a quarter 
of a million cattle as a precaution. 


The economic implications of finding effective 
ways of containing virulent diseases like this 
are huge. How would a government manage if 


Population [400 _] 
Proximity for infection [15 


Area [480 
there was an outbreak of a disease with these Infectious Period [70 
Fatal Period [150 


characteristics amongst humans? 


Immune Population [50 


A number of simulations modelling the spread of 


6 ¢ Days 18 
disease can be found on the website, http://www. pected Infected 142 
oxfordsecondary.com/cambridge-alevelmath. CREED mune 50 

Click on A Circle To infect it Dead 0 


These will give you some insights into how 
a model is developed - introducing greater 
complexity at each stage, so it starts with a static 
model, where infected people have fixed contact points with other people. 


‘The first screen shot on the right shows the model where the user can 
select ‘immune’ squares (shown in purple at the centre bottom of the 
grid), which can be thought of as representing barriers - such as a 

river, or a mountain range — that the disease cannot pass over but has 

to move around. In this instance almost the entire population was 

infected (subsequently becoming immune), with only a small number (the 
light blue cells) having escaped by chance, except for the block shielded by 
the barrier. 


The second screen shot shows the final model in the series. The population 
is moving around and there are more parameters to control — allowing you 
to explore what happens with different combinations of effects. ‘People’ 
who get infected in this model either die (shown as skull) or recover 
(shown as a light pink circle; when they recover they are immune from 
reinfection). 


Statistics is definitely not a lonely world 


Exam-style paper A 50 Marks 


1. The lengths, x cm, of 25 rulers were measured, and are summarised by ye — 30) = 0.6 
and }) (x - 30)? = 5.32. 


Find the mean and standard deviation of the rulers. [3] 
2. The distances run in training by a group of marathon runners are summarised in the 
following table. 
Distance run (x km) | Frequency 

15<x<25 | 19 
25 <x<30 | 7 
30 <x <35 | 24 
35<x<40 12 

Draw a histogram on graph paper to represent this data. [4] 


3. Lengths of climbing rope are normally distributed with mean 24.2 metres, 
and 20% of the ropes are longer than 25 metres. 


a) Find the standard deviation of the lengths of the ropes. [3] 
b) Find the probability that a randomly selected rope is shorter than 24 metres. [3] 


4. A nurse running a clinic knows that, on average, 22% of her patients do 
not turn up for appointments. 
a) Find the probability that, in a clinic with 10 appointments, fewer 
than 7 patients turn up. [4] 
b) For five randomly selected clinics with 10 appointments, find the probability she 
has at least one clinic with fewer than 7 patients. [3] 


5. A doctor specialises in sports injuries. The 134 injuries he treats in some sports over 
a month are summarised in the following table. 


Knee | Shoulder | Other 

Basketball 23 10 7 40 

Tennis 12 14 16 42 

Soccer 22 6 24 52 

57 30 47 

A patient is chosen at random from this group. What is the probability that the patient 
a) suffered a knee injury playing tennis [2] 
b) suffered a shoulder injury [2] 
c) was playing soccer, given that they suffered an injury classified as other? [3] 


Exam-style paper A 


6. a) A group of 12 people consists of 3 boys, 5 girls and 4 adults. In how many ways can 
a team of 4 be chosen if 


i) at least two boys are in the team 
ii) the adults are either all in the team or all not in the team 
iii) there must be at least one of each of the boys, girls and adults? 


b) Find the number of distinct permutations of the letters of the word ASSESS. 


7. A-spinner has 5 equal sections numbered 1, 2, 3, 4, 5. 
a) Find the probability of obtaining at least 6 odd numbers in 8 spins. 


‘The spinner is spun twice. Let X be the product of the two scores. The following table 
shows the possible values of X. 


xfil2[3]4]5 
1 ae 3 | 4 

2/2[4]6 [8 | 10 
3 | 3 [6] 9 | 12 | 15 
4 ats 12 | 16 | 20 
5 | 5 | 10 | 15 | 20 | 25 


b) Draw up a table showing the probability distribution of X. 
¢) Calculate the mean and variance of X. 


d) Find the probability that a randomly chosen observation of X is greater than 
the mean of X. 
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[2] 
[2] 
[3] 
[3] 


[4] 


[3] 
[4] 


[2] 
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Exam-style paper B 50 Marks 


1. It is given that X ~ N(46.3, 12.3). Find the probability that a randomly chosen 
value of x lies between 40 and 50. [3] 


2. The following back-to-back stem-and-leaf diagram shows the pulse rates of a group 
of students before and after a short period of moderate exercise. 


Before exercise After exercise 
(3) 9 7 3/6 
6) 9 8 6 3 1 o|]7}9 (1) 
(5) 8 6 4 2 1/8|/2 2 3 45 F OW” 
(1) 2}9/2 5 6 8 9 (5) 
10} 2 5 (2) 
Key 2 | 8 | 3 means 82 beats per minute before exercise 
and 83 beats per minute after exercise 
a) Find the median and quartiles of the pulse rates before exercise. (2] 
You are given that the median pulse rate after exercise is 89, the lower 
quartile is 83 and the upper quartile is 98. 
b) Represent the data by means of a pair of box-and-whisker plots in 
a single diagram. [3] 


3. The following table shows the results of a survey to find the average daily 
time, in minutes, that a sample of children spent exercising on the previous day. 


Time (t minutes) Frequency 
Ost<10 3 
10<t<20 5 
20<t<40 f 
40<t<70 4 
70 <t< 120 3 


The mean time was calculated to be 35.8 minutes. 


a) Form an equation involving f and hence show that the total number of 
children was 25. 4 


b) Find the standard deviation of the times. 


4, It is known that, on average, 15% of people in a certain country suffer 
from malnutrition. 
a) Find the probability that in a random sample of 10 people there is not 
more than one person suffering from malnutrition. 3 
b) Another randomly selected sample of 400 people is now chosen. 
Find the probability that not more than 40 people are suffering 
from malnutrition. 4 


skye Exam-style paper B 


5. A player has a probability of 0.4 of winning a tennis tournament, and 
a probability of 0.2 of not reaching the semi-final stage. If he wins the 
tournament, the corresponding probabilities for the next tournament 
are 0.5 and 0.1. If he reaches the semi-final but does not win the first 
tournament the corresponding probabilities are 0.5 and 0.3. 
If he fails to reach the semi-final the corresponding probabilities are 0.2 and 0.6. 


a) Find the probability that he does not reach the semi-final stage in the first 
tournament and wins the second tournament. 


b) Find the probability that he wins at least one tournament. 
c) Given that he wins the second tournament, what is the probability that 
he also won the first tournament? 3 
6. The random variable X has probability function P(X = x) = kx’, x= 1, 2,3, 4,5. 
a) Show that k = = 2 
55 


b) Find the probability P(X < 3). 
c) Calculate E(X). 
d) Find the probability that four randomly chosen observations of x contain 
exactly one which is greater than E(X). 4 
7. a) Four friends attend different schools, and each brings two classmates to 
a quiz evening. The quiz is for teams of 6, so the group make up two teams. 
i) How many different pairs of teams are possible if students at the same 
school are always on the same team? 2 
ii) How many different pairs of teams are possible if the four friends are 
all on one team? 2 
iii) How many different pairs of teams are possible if each team includes 
at least one student from each school? 3 
b) Find the number of distinct permutations of the letters of the word 
DIFFERENCE. 3 
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Answers 


‘The answers given here are concise. However, when 
answering exam-style questions, you should show as many 
steps in your working as possible. 


1 


Introduction to statistical 
thinking 


Exercise 1.1 page 3 


1. 


3. 


Long-term form plays an important part in an 
extended league format. However, a ‘surprise’ result in 
a knock-out tournament can be caused by a temporary 
lack of form or from a team playing above their 
normal standard. 

Either collect or use previously recorded data to 
estimate the travel times for the journeys involved. 
‘There is extensive research carried out by bodies like 
the World Health Organisation (WHO) into this 
question. Unfortunately much of this information 

is often inconclusive and the outcomes can also be 
confused by reports from interested parties such as the 
mobile phone providers. 


Exercise 1.3 page 10 


L 


2 


3. a) 


Days late 0 


Frequency | 19 


Depth 
(feet) 


Frequency | 3 4 4 4 8 6 


Make of car (qualitative, observed) 

Colour of car (qualitative, observed) 

Year of registration (quantitative, discrete, may be 
determined from the number plate) 

Mileage (quantitative, continuous, read from 
milometer) 


b) Length of race (quantitative, continuous, 
measured or stated in race information) 
Number of entries (quantitative, discrete, counted 
or determined from the entry list) 
Winning time (quantitative, continuous, 
measured) 
Make of winning bike (qualitative, observed) 


) Number of bulbs (quantitative, discrete, counted) 
Size (quantitative, continuous, measured) 
Colour (qualitative, observed) 
Make (qualitative, observed) 


Answers 


d) 


e) 


8) 


h) 


k) 


1) 


Name (qualitative, observed) 
Location of source (qualitative, observed) 
Length (quantitative, continuous, measured) 
Rate of flow at various points (quantitative, 
continuous, measured) 


Number of sheep (quantitative, discrete, counted) 
Colour (qualitative, observed) 

Place of origin (qualitative, observed) 

Weight (quantitative, continuous, measured) 


Number produced per day (quantitative, discrete, 
counted) 

‘Type of chocolate (qualitative, observed) 

Name (qualitative, observed) 

Weight (quantitative, continuous, measured) 


Consistency (e.g. slushy) (qualitative, observed) 

Air temperature (quantitative, continuous, measured) 
Depth (quantitative, continuous, measured) 
Duration of snowstorm (quantitative, continuous, 
measured) 


Name (qualitative, observed) 

Height (quantitative, continuous, measured) 
Team (qualitative, observed) 

Number of home runs scored (quantitative, discrete, 
counted) 


Gender (qualitative, observed) 

Age (quantitative, continuous, measured) 
Height (quantitative, continuous, measured) 
Loudness of roar in decibels (quantitative, 
continuous, measured) 


Year of first opening to the public (quantitative, 
discrete, research historical records) 

Number of visitors per day (quantitative, discrete, 
counted) 

Age (quantitative, continuous, measured) 

Height (quantitative, continuous, measured) 


‘Types of business (qualitative, observed) 

Number of business (quantitative, discrete, counted) 
Height of tallest building (quantitative, continuous, 
measured) 

Annual turnover (quantitative, discrete, counted) 


‘Type of vehicles (qualitative, observed) 
Number of vehicles (quantitative, discrete, counted) 
Number of traffic police (quantitative, discrete, 
counted) 

Time taken to travel one mile (quantitative, 
continuous, measured) 


2 Measures of location and 


spread 
Skills check page 13 
1. a) 2 b) 2 3 5 d) 2 
2. a) 68(1dp.) b) 65 c) 2 
d) 6 


Exercise 2.1 page 16 
Ty va} 27 b) 76.5 ©) 77 
2 a) 4 b) 4 <) 41(1dp.) 
3. 90.8 beats per minute (1 d.p.), 109.1 beats per minute 
(1 dp.) 
62.8cm (1 d.p.) 
5. 21.1 minutes 


6. $16600 

Exercise 2.2 page 19 

lL. Q.=75,Q=77 

% @=3,Q=5 

3. a) 64beatsper minute b) 8.5 beats per minute 

4, a) 28 b) 1 

5. a) 49.5 seconds b) 58 seconds 

6. a) 19.5 minutes b) 18 minutes 

Exercise 2.3 page 24 

1. a) mean = 13.8 (3s.f.); standard deviation = 3.86 
(3 s.f.) 

b) mean = 6.38 (3 s.f.); standard deviation = 1.37 

(3 s.f.) 


2. mean = 74.3cm (3 s.f.); standard deviation = 1.99cm 
(3 s.f.) 

3. mean = 346 g (3 s.f.); standard deviation = 11.5g 
(3 s.f.) 


4, mean = 2.5; variance = 3.52 (3 s.f.) 

5. mean = 0.3; standard deviation = 3.43 (3 s.f.) 
6. 2.77 (3s) 

7. 23.1(3s£) 

8. Standard deviation is approximately 10. 
Exercise 2.4 page 28 


1, mean = 80.5, variance = 132.5 

2. mean = 1137, variance = 322 

3. a) 55.85°F —b) 
)  13.25°C 


10, 12, 11.5, 13, 13, 14, 15.5, 17 


a) mean = 11.2 (3s.f.), variance = 2.64 (3 s.f.) 
b) X=0,1,2,3 

c) mean = 1.08 (35.f.), variance = 0.660 (3 s.f.) 
a) X=0,3,5,7,9, 14 

b) mean = 4.46 (3 s.f.), variance = 7.48 (3 s.f.) 


c) mean = 16100 (3s.f.), 
variance = 46700000 (3 s.f.) 


a) T=0,1,2,3 
b) mean = 1.15 (3s.f), variance = 0.873 (3 s.f.) 
c) mean = 18.2 (3s.f.), variance = 21.8 (3 s.f.) 


Summary exercise 2 page 30 


1. 


2. 
3. 
4. 


as 


10. 


11. 


3 


6; 25.4 

a-1.85; 8.55 

799; 7188 

a) mean = 65.6 (3s.f.), variance = 31.9 (3 s.f.) 
b) 61523.646 c) 64.7 (3 sf.) 


median = 8, interquartile range = 9.5 

a) mean = 67.6, standard deviation = 6.25 (3 s.f.) 
b) a= 36.35, b=0.625 
a=10.11; 6 =3.04 (3 sf.) 
¥ =70.3; 6 =2.72 (3 s.f.) 
Dx=977; Dx’ =38507 
i) mean height = 170cm 
ii) st. dev. =5.94cm 


4-year-olds: mean = 99, standard deviation = 9.6 


14-year-olds: mean = 77.1, standard deviation = 7.6 


‘The pulses for 14-year-old children are considerably 
lower on average and they are more consistent. 


Representing and analysing 
data 


Skills check page 33 


1. 


a) median = 12 lower quartile = 9 
upper quartile = 19 

b) median=4 lower quartile = 4 
upper quartile = 5 


Exercise 3.1 page 37 


1. 


a) ‘Type A: median = 44; interquartile range = 8 g 
‘Type B: median = 51 g; interquartile range = 6 g 


Answers 


) Type A Type B Exercise 3.2 page 40 


5} 2 1. Onaverage resort B is hotter than resort A. 
meorllas ‘Temperatures are more variable in resort A than in 
resort B. It would also be useful to have information 
97755 | 3 about the temperature at different times of the year, 
444422221 | 4 |4 hours of sunshine and the amount of rainfall. 
8777766655 | 4 |56778889999 2. a) Onaverage males are paid more than females. 
2210 | 5 |00001111122334 ‘The overall range is similar for males and females 


but the very low paid males are outliers. The 


987 | 5 |556677899 interquartile range is less for males so their 


Key: 5 | 4 | 6 means 45g for Type A and 46 for salaries are less variable than the salaries of 
Type B females. 

©) On average Type B plums are heavier than b) The average starting salary is similar for males 
Type A plums. Type A plums are more variable in and females with the females’ salaries varying 
weight than Type B plums. more than the males. It may be that the males 


stay with the college longer which could account 


d) ‘Type B because they are heavier and less variable, a 
for their average salaries being higher in 


2. Before exercise: median = 81 beats/minute, 


interquartile range = 20 beats/minute Meda 

After exercise: median = 94 beats/minute, 3. a) 

interquartile range = 13 beats/minute 2010 HT x x x 

Pulse rates were higher on average and less variable 

after exercise than before. 2015 HT HK - ” 
3. January: median = 22.55 g, interquartile range = 2.25g 

April: median = 21.5 g, interquartile range = 1.9g 9" 20 40 60 80 100120 140 160 

On average the Dunnocks are heavier and their Number of employees 


weights are more variable in January than in April. b) ‘The number of employees is higher on average 


4: Males: median=11695mm, and more variable in 2010 than in 2015. 
interquartile range = 13mm 


Females: median = 179.5mm, 4. a) 
interquartile range = 16mm 
On average the females have a longer wingspan than 


Min | LQ |Median| UQ | Max 


[ 30 | 126 | 147_| 161 | 280 | 
85 99 148 | 148 


the males and females have a more variable wingspan Vegetables 
thenthe males: Seafood | 34 | 34 | 84 | 84 | 84 | 
5. Fruit Vegetables 
b) The 5 summary statistics are equal. 
3] O | 222344555678 ) 
© 
9953221 1 18 Fruit 
66310) 2 |6 
eal veesanis| Hi] 
Key: 3 | 2 | 6 means 23 for Fruit and 26 g for Od ao \ogaenenoaaoSad 
Vegetables Serving size 
Fruit: median = 19g, interquartile range = 14g 
Vegetables: median = 5g, interquartile range = 5g 5. Onaverage the highest scores were in the long jump 
‘The amount of carbohydrate in fruit is larger on and the lowest scores were in the 1500 metres. The 
average and more variable than in vegetables. overall range was highest for the 1500 metres and 


lowest for the 100 metres. The interquartile range 
was highest for the long jump and lowest for the 

100 metres. ‘There was less variation in the scores for 
the 100 metres than the other two events. 


Answers 


Frequency density 


2. 
0.018 
0.016 
0.014 
0.012 
0.010 
0.008 
0.006 
0.004 
0.002 
> 


10000 20000 30000 40000 50000 
Salary ($) 


Exercise 3.3 page 43 5. a) 
i 0.10 
3 0.09 
23 0.08 
zB 2 
52 2 0.07 
8 & 
B1 S 0.06 
— S 0.05 
9" 20 40 60 80 100120 E oot 
Length em), A 0.03 
2. a) = 002 
x 5 0.01 
z4 
23 0" 2 4 6 8 10 12 14 16 18 20 
: 5 Time (minutes) 
a 1 b) 10 
tv) . 
AD: 20:80 FAO 1801168) 70 Exercise 3.4 page 47 
Protein (grams) 
1. a) median = 64, lower quartile = 60.5, 
b) mean = 34.2g (1 d.p.), upper quartile = 69 
standard deviation = 14.7g (1 d.p.) ‘ 
c) Onaverage the daily protein intake was higher ) Pulse rate Frequency 
in the second country than in the first. The daily a 
protein intake was more variable in the first 51-55 4 
country than in the second. -———————_J 
3 56-60 5 
8 61-65 12 
ve 
G6 66-70 8 
2 
35 
24 71-75 4 
S 
& 3 76-80 2 
2 
1 81-85 1 
0" 40 20 30 40 50 60 70 80 c) median = 64.5 (1 dp), lower quartile = 60.7 
Time (minutes) (1 d.p.), upper quartile = 69.6 (1 d.p.) 
4. a) median = 28, lower quartile = 23, 


upper quartile = 34 


b) 
Age Frequency 
17-20 rg 
21-24 6 
25-28 12 
29-32 = 
33-48 13 


Answers 


189 


190 


4, 


) Exercise 3.5 page 51 
= 1. a) A=169(2dp.), B=-1.62 (2dp.), 
C=-1.04 (2 dp. 
40 (2d-p.) 
5 b) Group A 
= 30 2. a) A=-0.07(2d.p.), B=0.17 (2d.p.),C=-0.21 
2 b) Group C 
g 20 c) Yes, but the signs are reversed for Groups B and C 
2 3. a) 31.9375 
10 
b) 2 | 566777899 
O10 20 -30:«40—s50 2 || 228 
Age 4) 25 
d) median = 28, lower quartile = 24, 5 | 2 
tile = 35 
) MEPES _ as Key: 3 | 4 means 34 
a 
2504 c) median = 29, lower quartile = 27, 
upper quartile = 33.5 
560 d) 45 and 51 are outliers 
s | e) 
F Hit. - 
£ 1504 
2 20 25 30 35 40 45 50 55 
z 1004 Age 
3 
f) The distribution is positively skewed because 
504 there is a long tail to the right. 
Also, Q, - Q,>Q,- Q, 
4, a) median = 49.5 seconds, 
. 10000 20000 30000 40000 50000 interquartile range = 58 seconds 
asia ” Hi 
x 
b) median = $13 700, lower quartile = $11 400, ! 
upper quartile = $17 000 rrr ar 
3 4% 0 20 40 60 80 100 120 140 160 180 
a) A Time (seconds) 

50 c) The distribution is positively skewed because 
there is a long tail to the right. However, the 
distribution is almost symmetrical between the 

40 ‘ 

> quartiles. 

2 

2 d) A zero waiting time occurs when there is a train 

£ 30 at the platform and the passenger does not have 

£ to wait. 

& 

=z 20 | 

é Exercise 3.6 page 54 

10 1. Onaverage their pulse rates were higher after the 
PE lesson than after the mathematics lesson. They 
also varied more after the PE lesson than after the 
r r , > ; 
oF ey, 6 wm abies ayn mathematics lesson. 
Time (minutes) 
b) 7.3 minutes c) 6 minutes 
Answers 


2. a) mean = 1.1kg, variance = 0.075 kg e) The mean would decrease because the exit 


b) On average the new diet shows a higher weight times for the 12 flights being excluded are 
loss than the traditional diet. The weight loss with all bigger than the current mean of 
the new diet is less varied than the weight loss 17.9 minutes. 
with the traditional diet. 4, i) * 


3. On average the plants are longer in garden A than 
in garden B. The overall range is slightly bigger for 
garden B than for garden A but the interquartile range 
is bigger for garden A than for garden B. This means 
that there is more variation in the central 50% of the 
data for garden A than for garden B. 


n 


Frequency density (per second) 


Summary exercise 3 page 55 4 zoe > 


1. a) 
|——. Time (seconds) 


ii) 32 seconds iii) 5 seconds 


0 10 20 30 40 50 5. i) median = 0.41 seconds, LQ = 0.325 seconds; 
Time delay (minutes) UQ= 0.52 seconds 


b) The distribution of delays is positively skewed 
because there is a long tail to the right. Also, 
Q-272,-Q a 

BCs computers 

c) On average there were longer delays on the 

second day than on the first. The delays on the 


second day were more varied than the delays on Tablets i ___ oo 
the first day. 
2. 0 | 013346788 — ror or 
1 | 12245 0 01 02 03 04 05 06 07 08 09 1 
Time (seconds) 
2/8 
iii) Desktop computers are quicker (on average), and 
2| 4 the two types have similar variability. 
Key: 1 | 6 means 16 passengers 6. a) 0 | 4567777 
3. a) The data is continuous and has unequal class 1 | 2388 
widths. ale 
b) upper class boundary = 14.5, lower class 
3 | 334 
boundary = 9.5 
4) 4 
©) 
6 5 |6 
Bs 6/9 
2 
$4 
> Key: 2 | 4 means 24 
$ id b) 5 | 578 
& 2 
c 6|3 
1 
7 | 
0" 10 20 30 40 50 8 | 479 
Time (minutes) 9 | 0333559 
d) mean = 17.9 minutes (1 d-p.), Key:8 | 5 means 85% 


standard deviation = 12.2 minutes (1 d.p.) 


Answers 


9 KTH 4 Probability 


Skills check page 62 
50 55 60 65 70 75 80 85 i. a) pen b) pf 
Life expectancy 12 6 


‘The overall distribution is negatively skewed but 2. HH, TT, HT.TH 
it is almost symmetrical between the quartiles, 


Fi Exercise 4.1 page 64 
1, a) A={l,2,4}, B={I, 4}, C= {2, 3, 5}, D= {3, 6} 


liid 
= ot oe bd) pppz 9 (2h d) 
0 — 10000 20000 30000 40000 50000 60000 

GNI (US $) 2 0 f) ; 


nS 
6 


Positive skew. 


I 2. a) Aconsonant is chosen. 
b) A-={A,E, I}, B= {B}, C={A, B,C, D, E, G, I, M}, 


o 2 8 a & Ss © & D={GE} 
$ € ese 8 e 8 
s8 8 8 8 s s 8 1 1:82 
s =& & & & & ¢ ) ppes ) AED 
$8 & 
Population (thousands) °) i f) 1 2) : 
Very positively skewed. 
7. A 3. ©) 10 
4. b) 5 d) 3 
g Exercise 4.2 page 66 
id 1. {(VV); (VO); (ViC); (O,V); (0,0); (O,C); (C,V)s 
A (C0) (C.C}} 
z 2. {(V,O); (VC); (O,V); (O,C); (C,V); (C,O)} 
3 (sum [1 ]2 13 14 15 |6 
i els las le iz 
2 4 6 8 10 12 14 16 18 20 22 2_[3 [x [s [6 |7 [s 
Time (minutes) 3 [4 [5s [x [7 [s |o 
4 |s [6 [7 [x [9 [10 
8 100m HLOoH 
5s [6 |7 [s [9 [x [nu 
Long jump His 6 [7 [s fo [io [nu [x 


= : t r T t : 
600 700 800 900 1000 1100 1200 4. a) {(H, 1); (HL 2); (H, 3); (HL 4); (HL 5); (HL 6); (T, 1)s 
Points (T, 2); (T 3); (T 4); (T 5); (T, 6)} 
On average the points scored in the long jump are b) 
highest and the points scored in the shot are least. score [1 ]2[3 [4 [5 [6 | 
‘The variation in the points scored is greatest in the H 1[2[3]4]5]o [7] 
long jump and lowest in the shot. t 2([3l4[slel7[s | 
iceme Answers 


5. 1 1 1 
*) [product [1 [213 5 [6 4a) 5 b) 5 9 i 
1 1 {2 [3 [4 [5 [eo e092 ot wo & 
9 2 [4 |6 10 [12 
3 3 [eo [o |i2 [is | is Exercise 4.4 page 72 
4 4 [8 [12 [16 | 20 | 24 1, a) 02 die b) 0.02 
5 5 | 10 | 15 | 20 | 25 | 30 
6 6 [12 [18 [24 [30 | 36 ae tat 
i at le hk jaf, aL, : 0.8—~ survive 
) ig ii) js ii) 5 iv) ig 
die 
6. a) no os 
tow [1 ][2 [3 ]4a [5 [o 0.05 seat 
belt 
1 a fa [a [fo [@ fa en > 
2 1 [2 [2 [2 |2 ]2 ; survive 
3 1]2 [3 [3 [3 [3 2% a) 590) OF 
4 i|2 [3 [4 [a fa 3. Student A 
5 1 [2 [3 [4 [5 Js se oma 
6 1 [2 [3 [4 [s |e : 
7 1 1 
b) id 36 ii) 12 iii) 36 
7. a) 0.4 no homework 
Difference 2 [3 [4 [s lo 
1 o fi [2 [3 [4 [5 
Student B 
2 1 fo Ji [2 [3 ]4 
3 2/1 1/0 Ji /2 | 3 09 homework = — 0.72 
4 3 [2 0 2 
diary 
2 0.8 
5 18 uw On no homework 0.08 
6 5 {4 [3 [2 [1 
i 1 * 05 homework 01 
» » 2 i 4 itt) 0 Ne 
diary. 
Exercise 4.3 page 70 
15 pag 15 8 no homework = 0.1 
a) >) os a) 0.82 b) 0.492 ©) 0.752 
2. a) 04 b) 05 a9 2 yp® 9 & w 
3. 0.03 fired 271 542 260 364 
25 7 
new 5 a) TF) og 
0.06 job 
0.97 not fired 
fired 
0.94 not a3 . 
new 
job 


0.97 not fired 
a) 0.9118 b) 0.0864 ¢) 0.0018 


Answers 


late Summary exercise 4 page 79 


6. 1 
4 1. a) & purple 
1 bus 3 not 8 ‘ 
a Z late To Pale 5 
= pink 
‘i 1 late js 
os 5 & urple 
3 4 2 9 a 
bo ae pink 
4 not 7 
5 late P 
o ink 
7 é 9 p 
ry To late 4 a 
car 5 99 
a9 a) 0.002978 b) 0.671 (3 sf) 
20) 2, ¢) There is more than a 67% chance that a person 
10 be who has tested positive does not have the disease. 
late 5 
35 a) 0 B) 
7% a) 0.035 b) 0048 <) a ©) Aand Bare not independent since P(A |B) # 
8. a) i red PIA). 
2 1 1 2 
a i) Gg ii) Z iii) = 
1 
z ES i b) i) no,since P(A B) 40 
3 blue ii) no, since P(A| B) # P(A) 
2 red ®) Z_— correct 
1 3 ill 
2 2 fore incorrect 
7 e 4 
3 blue 
7 be Fi hard =~ 3 correct 
D 2 incorrect ' 
25 2 f- incorrect 
b) i) 2 ii) 35 : 
ti correct 
Exercise 4.5 page 78 b 2 nr <i 
1. a) 0.28 b) 0.82 c) 0.12 2 _ 3 7 incorrect 
y 
4, a) 04 b) 0.5 c) 03 3 
59 ey 100 age: i im correct 
5 a) 1) oe) 6) cag ) ipa 3 a 
6. a) mutuallyexclusive b) mutually exclusive z incorrect 
c) neither d) mutually exclusive | 
and exhaustive b) = a2 
e) independent f) mutually exclusive 
7. a) true b) true 
c) true d) false 
tym Answers 


ew defectve Exercise 5.1 page 85 


1. a) nota discrete random variable since 


IP(X =x) #1 


x ae eee b) adiscrete random variable 
0.25, c) adiscrete random variable 
0.05 defective d) nota discrete random variable since P(X = 5) <0 
0.45 
y 2 a) [E [1 [2 [3 [4 [5 Je 
not 1 1 fafa fa 
Pe ove defective P(X=x) |6 16 | ele lele 
0.02 defective b) 
z y 2 ]4 [6 [8 Ji [12 
1 1 1 1 1 
P(Y=y) =l=lelezle = 
0.98 not 6 6 6 6 6 6 
defective ) 
¢, 
z 9 6 
b) i) 0.4275 ii) 0.9615 fifa] 6 | 25 | 3 
1 i 1 1 1 1 
©) Por0.445 (3 sf) PiZ=2) |elelels|s|s 
7 a) 035 
4 Ty of1 
b) P(FS) =0.35 # P(F) x P(S) = 0.6 x 0.4 = 0.24 
©) The golfer’s form from the first week is likely to pwew) | 2] + 
be carried into the second week. 
1 1 1 
a) z b) 3 c) z 3. 
0 
10. a) 0.05 b) 0.125 c) 08 
11. a) P(AMB)=P(A) x P(B), Pt 
P(A UB) = P(A) + P(B) — P(A) x P(B) 
2 1 
b) Zorg 
12. a) 02 b) 2 
13. i) 0.85 ii) 0.41 
14, i Exercise 5.2 page 87 
la 
15. i) 0.465 ii) 0.221 » TF [if2]3]4 
iii) 0.488 iv) 0.512 Probabili¢y, || 2: || 2.) 2 | 4 | = 
y 
16. i) 25-30 ii) 13 is | 15 | 15 | 15 | 15 
iii) 72 iv) $= 0.62 b) [| [1 5 
ans; 12 6 4 
Probability | zlslels 
Probability distributions 
5 ey. ? ©) nota probability function since it is undefined for 
and discrete random variables rel 
Skills check page 83 9 TF Ii [2 [3 [a 
1. 12,643 a 30 | 20 | as | 12 
pies Probability | = | > | = | = 
2. a=0.1,b=03 a ee a Me 


Answers 


2 a) [7 i (2 [3 [4 Summary exercise 5 page 94 
18 
i 4 3 2 1 1. ob) = c) 1.92 
Probability | 75 | io | io | to 25 
% Wh Ss ©) 242(3s£) 
» [5 12131415 3. a) 04 b) 6.56 ©) 0 
“lariat Ts 4, Xs nota discrete uniform distribution, Y and Z are 
Probability | = E s\lsls discrete uniform distributions. 
5. a) 03 b) 8 dl 
9 d) -1 2 4 
w 2[3]4[s[o]7[s[9fijufi2] 6 a » mean = 2, standard deviation = 1.14 (3 s.f.) 
ce|(|| 2 4a}5 s|4|a|2|4 ii) 5.1 minut 
Probability | 3¢ | 36 | 36 | 36 | 36 | 36 | 36 | 36 | 36 | 36 | 36 es See a 
b) ) 035 i) F 
4) [, [i [2][3]4 7. a) a+b=0.4,8a+ 11b=3.95 b) a=0.15, b= 0.25 
rt3alsl7 ©) 2.0475 
Probability | — | 7 | =| ie 
rT 16 | 16 | 16 i 
2 ee 1 [2 [3 [4 Js 
12 6 3 9 7 5 3 1 
4a) a5 is ob P(X=3) 25 | 25 | 25 | 25 | 25 
1 1 
M3 er b = ) 22 d) 1.36 
3 
4b) i) 052 i) = . 
ae B oa) Ty [1012] 15 | 16 [18] 20] 24] 25 
1 5 
5. c=a5 P(Y<3)=35 P(X=x) [0.1 .a| 05 | 00s [o2 0.1]0.2] 0.1 
Exercise 5.3 page 90 b) 17.85 ©) 26.7275 
1a) 3.2 b) -0.2 ©) 7.6875 10. b)  E(X) =3.33 (3.6) c) 0.689 (3 s.f.) 
2 aa) 2 2 MH. a) eb) E(X)=5, Var(X)=3 
3 a) i) 03 © ii) 27 iii) 9.3 
3 5 
b) i) + ii) 75 iii) 615 ©) 3g 4) mean = 3500, variance = 6750000 
4 5. a=0.4,b=0.2 esl) Score 2 3 4 5 6 
@ = 0.175, b = 0.125, P(X>E(X)) = 0.325 Probability | + | 2+ | 2 | 2) 2 
Exercise 5.4 page 93 ii) E(X) = 43 Var(x) = 2 
1. a) E(X)=7.1, Var(X) = 1.29 1) 2 
b) E(X) = 0.1, Var(X) = 1.29 18 
©) E(X) = 1.9375, Var(X) = 1.43 (3 s.£) scA) 
wl 
2. a) mean = 2.33 (3 s.f.), variance = 1.56 (3 s.f.) Number of green cards | 0 | 1 
b) mean = 3.5, variance = 2.92 (3 s.f.) Probability | al | a = 
©) mean = 7, variance = 5.83 (3 s.f.) : 
3. a) i) 03 ii) E(X)=2.9, Var(X) = 1.69 ii) 0.6 
») ) 2) EE=45,varey=0917 8st) MD) a= 025.b=0.15 i) 43875 
E(Y) = 2.53 (3 s.f.), Var(Y) = 1.97 (3 sf) 
a=0.1,b=0.5, Var(X) = 2.21 


.3, Var(X) = 18.01 


yom Answers 


6 Permutations and combinations Exercise 6.6 page 108 


i: 2 2 + 
Skills check page 97 Bs = " 
1. HH,TIHT,TH 3 a) 5 ) = 2 = 
2. HI, H2, H3, H4, H5, H6, T1, T2, T3, T4, T5, T6 4, 0.398 5. 0.0128 6. 1 


Exercise 6.1 page 99 Summary exercise 6 page 109 


1. a) 720 b) 6 ©) 479001600 i 
2 a) 7 b) 20 J 1. 369600 ee 
3. 24 4. 40320 5. 40320 3. 02 4 ; 
6. 120 7. 120 8. 3628800 
5. 3024000 
Exercise 6.2 page 100 6. i) 151200 ii) 64800 
1. a) 8 b) 210 ¢) 120d) 120 7. i) 726485760 ii) 3628800 iii) 0.995 
; 1 
2. a) 13366080 b) 2.18 x 10 & pie ini 38800 
3. 15120 4. 3024 5. 1814400 
6. 6720 7. 336 «8 360 ae DL 28 i) 20 aid) 79) 
Exercise 6.3 page 102 7 The binomial distribution 
1. 240 ‘ 
Skills check page 114 
2. a) 40320 b) 2880 
1. a) 3628800 b) 6435 
3. a) 241920 b) 40320 
2. 104 
4, 96 5. 24 6. 42 
7 a) 72 b) 24 Exercise 7.1 page 119 
8. a) 261273600 b) 3110400 lL al 7 2 35 35 2171 
9 144 b) i) 35 ii) 21 
Exercise 6.4 page 104 2. a) i) 210 ii) 1 iii) 5005 
1. a) 10080 b) 360 fv) 4950 
2 a) 120 b) 720 b) i) 45 ii) 462 iii) 1 
3. 1680 4 56 iv) 4.71 x 10" (3 s.f}) 
5 1960 6 8008 3. a) 0.234 (3 sf) b) 0.0938 (3s.£) 
7. a) 151200 b) 5040 4. a) 0.324(3s.f) b) 0.0102 (3s.f.) 
5. a) 0.142(3s.f.) b) 0.0420 (3 s.f.) 
Exercise 6.5 page 107 6. a) 0.00149 (3 s.f) b) 0.231 (3s.f) 
1. 27405 2. 15504 7. a) 0.0173 (3s.f.) b) 0.137 3s.f) 
3. b) Choosing r objects to assign to the ‘taken’ group 8. a) 0.0131 (3s.f.) b) 0.0393 (3 s.f) 
has a one-to-one correspondence with choosing 9. a) 0234(3s£) b) 0.0938 (3 sf) 


the n ~ r objects to assign to the ‘left behind’ group. 
. a) 14950 b) 8965 
5. 53130 


531 300; they need to ensure that neither the winner 
nor the runner up test positive for performance- 


10. a) i) 0.000977 (3s.f.) ii) 0.00977 (3 s.f.) 
iii) 0.0439 (3s.f) iv) 0.117 (3 s.f.) 
b) i) 0.0107(3s£) ii) (0.945 (3s.f.) 
iii) 0.828 (3 s.f.) 


enhancing drugs. 11. a) 0.264 (3s.f.) b) 1.00 (3s.f) 
12. a) 0.0913 (3 s.f) b) 9.09 x 109 (3 s.f.) 
13. a) 0.237 (3s.f.) b) 0.0879 (3 s.f.) 


Answers 


Exercise 7.2 page 121 


La) 175 b) 8.75 
2. mean = 30, standard deviation = 4.24 (3 s.f.) 
3. a) 2 b) 0.421 (3s.£) 
4, 100 
5. a) mean= 16, variance = 3.2 b) 0.589 (3 s.f.) 
6. Var(X) =20p(1 - p), p= 0.5 
7. 0.360 (3 s.f.) 
Exercise 7.3 page 126 
1. a) No, there are not a fixed number of trials. 
b) Yes, n=10, p=4 
c) No, since the balls are taken without replacement 
the trials are not independent. 
d) Yes, n=5,p=4 
e) Yes, n=25, p=t 
f) No, you are not counting the number of times a 
particular outcome is observed. 
2. a) i) 0.0424 (3s.f.) ii) 0.141 (3 s.f.) 
iii) 0.228 (3 s.f.) iv) 0.236 (3 s.f.) 
b) 3 c) mean = 3, variance = 2.7 
3. a) The cars are independent. n = 40, p = 0.08 
b) The screws are independent. n = 48, p = 0.02 
c) n=50,p=03 
d) There are a sufficient number of balls to assume 
that the colours of the balls are independent. 
n=50, p=03 
4. a) mean = 3, variance = 1.5 
b) mean = 3, variance = 3.07 (3 s.f.) 
c) No, the variance of 3.07 is more than double the 


variance given by the binomial model. 


Summary exercise 7 page 127 


1 


198 


2. 
3. 
4. 


a) 0.817 (3s.f.) b) 0408(3s.£) ) 13 

a) 125 b) 1.70 (3s.£) 

a) 0.825 (3 s.f.) b) 0.177 (3 s.f.) 

a) i) 0.683 (3s.f.) ii) 0.576 

b) 0.166 (3 s.f.) 

c) mean = 2, standard deviation = 1.10 (3 s.f.) 
Answers 


10. 


11. 
12. 


13. 


8 


d) i) 
ii) These values do not support Louise’ belief. 
‘The mean of 2.1 is close to the mean given 
by the binomial model but the standard 
deviation of 1.60 is approximately 45% 
greater than that given by the binomial 
model. 
a) i) 0.839 (3s.£) 
iii) 0.835 (3 s.f.) 


b) mean = 210, variance = 151.2 


mean = 2.1, standard deviation = 1.60 (3 s.f.) 


ii) 0.187 (3s.f) 


a) i) 0.219(3s£) ii) 0.950 (3s.f) 
iii) 0 iv) 0.686 (3 s.f.) 
b) i) mean =9, standard deviation = 1.90 (3 s.f.) 


ii) mean = 9.1, standard 
deviation = 1.58 (3 s.f) 


‘These values are close to those given by the 
binomial model and support Ronnie's claim. 

a) i) 0.839 (3s.f.) ii) 0.107 (3 s.f.) 

iii) 0.831 (3 s.f.) 

b) For the binomial model the mean is 8 and the 
variance is 6.4. These values do not support 
Conn’s claim because of the high value of his 
variance. 

a) 0.873 (3 s.f.) b) 0.0266 (3 s.f.) 


a) Two assumptions from: there are a fixed number 
of trials; each trial must have the same two 
possible outcomes; the outcomes of the trials are 
independent of each other; the probability of the 
outcomes remains constant. 


b) i) 0.0402 (3s.f.) ii) 0.314 (3 s.f.) 
c) Proof 

a) 0.411 (3 s.f) b) 0.210 (3 s.f.) 
c) The colours of the beads are independent. 
0.438 

i) 0.647 


ii) No — because weather on one day is often 
influenced by the weather on the day before. 


0.660 


The geometric distribution 


Skills check page 132 


1. 


2. 


a) P(B|A)=5 b) Pela’) =+ 


1 
P(2 sixes | at least 1 six) = aa 


Exercise 8.1 page 136 b) X~Geo(0.523); expected frequencies: 41.8, 20.0, 
1, a) 0.1875 b) 0.015625 9.5, 4.5, 2.2, 1.0, 0.5, 0.2 


2 a) 016 b) 0.36 c) Very close fit. 
Seva) C0 by onal Summary exercise 8 page 143 
4, a) 0.0154 b) 0.990 
1. a) 0.1024 b) 0.262 
5. a) 0.0864 RS 
b) P(X=6| X>2)=P(X=4) = 0.0864 sy of 3 > 8 
6. 0.316 27 27 
7. 0.218 4. a) 0. b) 0.216 
8. 0.0625 ia : 
©) P(X=6| X>3) = X=O) _ 06x04 _ gery o.g 
9. 0.272 WX>3) 06 
= 0.144 
Exercise 8.2 page 138 d) amemoryless property 
ate . 5. 0.613 
2 6 a) + b) 2% 
2. 10 r a 
3. 0.5904 7. 0.0625 
4; a) 2 i . 0,668 
= 
= 9. a) 0.1029 b) 0.2401 ~—-e)-_:0.832 
5 a b) 2 
a” ya 10. a) X~Geo( +) b) 1) 00988 ii) > 
a) 0.1024 b) 0.2304 63 H w008e 
7 a) 20 b) 0.226 c) 0215 d) 1 Lb) P(Y, + ¥,>3)=1-P(Y, + ¥,=2,3) = 
4) N~Ge0(2) 2 1 = (0.8 +2 x 0.3" x 0.7) = 0.784 
‘ 36 5 
13. a) — 
©) 0.598 4) 0.402 36 
9. a) X~Geo(0.5122) —b) 1.95 5} OBO Gs) 
©) 0.5122 d) 0.4878 c) 0.402 (3 sf.) 
Exercise 8.3 page 142 a) 0147 
5 
La) Yes, p=2 b) No- binomial een 


c) No - probability changes 
a) Yesp=2 2 Yenp=2 9 The normal distribution 


£) No -this isa linear combination of random Skills check page 146 
variables. 1. w=41,0=10 
2. Mode, mean, median: 205 


5 
— b) 1,25, 5, : 
a) a ye Ry ae Exercise 9.1 page 150 


1. a) i) 2 i) -05 if) 15 iv) OO 
b) i) 65.1 ii) 39.2 iii) 532 iv) 70 


3. Independence and the probability remains constant 
for all. 


a) Questionable that the probability remains 


constant over the whole day — p = 0.08 if the 2. a) i) -14ii) -5.6 iii) 0.86 iv) -0.06 
model was thought adequate. b) i) 98.5 ii) 76.5 iii) 84 iv) 92 
b) p=0.02 ©) p=02 d) p=02 3. a) i) 1 ii) -05 iii) 25 
4. a) 191(3s£) iv) 0.233 (3dp) 


Answers 


b) i) 114i) -126 iii) 0.6 — iv) 24.6 4, a) =294,0=8.29 
a) 6 b) 59 b) Travel times are likely to have a positive skew - 
75 caused by roadworks, train delays etc. If this is not 
i018 severe then the normal is likely to be a reasonable 
. model (it is continuous and 0 is >3.5 standard 
i. deviations below mean, so truncation of negative 
Exercise 9.2 page 155 values is not an issue). 
1, a) 0.4761 b) 0.9957 <) 0.7881 a) 0.7734 b) 481hours c) 49.6hours 
d) 0.1089 e) 0.7721 a) 0.1056 b) 0.864 ©) 343ml 
2. a) 0.8599 b) 0.6179 <) 0.2358 d) 333ml ©) 608ml 
d) 0.0905 a) 0.2266 b) 0.253 
3. a) 0.2061 b) 0.7675 <) 0.8371 a) i) 11.51% ii) 86.21% iii) 483 seconds 
a) 0.9282 b) 0.4716 ) 0.0205 b) 351 seconds 
d) 0.9638 9. a) 662ml b) 0.0002% 
5. a) 1.282 b) 2.576 <) -1.960 10. a) 2.28% b) 29 ©) 17.95mm 
d) -0.842 e) 1.555 f) 2.120 d) 0.0063 
Exercise 9.3 page 159 . 
Pag Summary exercise 9 page 168 
a) 0.9641 b) 0.2119 c) 0.1587 
1, a) 0.0228 b) 0.2384 <) 874 
2. a) 0.3085 b) 0.9599 ) 0.3354 
2. a) 0.0196 b) 0.0531 <) 0.0010 
3. a) 0.9772 b) 0.3696 <) 0.3598 
d)  Itis unlikely that the performances in the two 
4 a) 05 ib): 1914279) <)', 16826 events are independent. 
Sia) WOe7a: AB) “DEBS. “ep! “D161: 3. a) 0.1587 —b):0.1624. =e): 3.88 ml 
Sa) Bis Ib): 26.0 4. a) Any two of the following: 
7 a) 937 b) 80.8 ) 86.7 x -Rimevaamete 
8 a) 497 b) 3.26 + itis infinite in both directions 
2 
% 2 1% 271 « it has a single peak at the centre 
11. 5.22 12. 4.98 . it is continuous 
15. 0546 + 95% of values lie within approximately 2 
|) = 31.9, 0= 5.86 standard deviations of the mean 
15. [= 73.3, = 18.7 * 99% of values lie within approximately 3 
16. = 1.54, 0=2.21 standard deviations of the mean. 
17. = 1074, = 69.4 b) =291 hours, o=29.7 hours c) $64600 
18. = 47.3, 0= 5.58 a) 0.1186 b) 0.7435 <) 265 
a) 9.45 minutes 
Exercise 9.4 page 165 b) Whether this is reasonable may depend a bit 
1. 124.7 on what the subject is: it probably would not be 
2 a) 0.2798 b) 0.4982 reasonable in Mathematics where fully correct 
answers will take very little time to mark, where 
3 ia) Proof b)! 45.92% =e). 0.1582 interpretative subjects like English or History 
d) Production processes are usually broadly might be modelled well by a normal (again, it is 
symmetric, and diameters are a continuous continuous and the negative truncation is not a 
measure. Diameters can’t be negative, but 0 problem). 
is > 100 standard deviations below the mean for 7 a) 0.0228 b) 82 ) 0.0150 
this context, so this is not an issue. The normal is J . : 
likely to be a good model for this context. 
200 FASS 


d) Itis unlikely that the performance in the written Exercise 10.2 page 175 
paper and the performance in the project are 


1, a) P(Y<415) b) P(Y>31.5) ©) P(¥>8.5) 


independent of one another. a) P(42.5<¥<855) 
a) i) Proof ii) 85 b) 0.2684 2. 
a) i) 0.2266 ii) 0.6678 iii) 0.0287 x 0 1 2 3 4 5 6 
b) profit = $34.75 probability | 0.015625] 0.09375 | 0.234375 | 0.3125 | 0.234375 | 0.09375 | 0.015625 
10. a) i) 0.0913 ii) 0.186 b) 139 a 
11. i) 0.266 ii) 0.268 iii) 647ml 0.36 
12. 9.2cm 0.3 
13. = 323.7, 0= 8.32 Paid 
5 02 
Bo.15 
& 
10 The normal approximation on 
5 5 outa a 0.05 
to the binomial distribution b 
Skills check page 172 © ‘numberof successes 
1. mean = 2.4, variance = 1.68 
2. a) 0.9772 b) 0.3696 ©) 0.3598 x 0 a a | 4 5 6 
Prob | 0.531441 0.354294] 0.098415 0.01458 [0.001215 5.4E-05 | 0.000001 
Exercise 10.1 page 174 a=bia=l05 
0.6 
2 
2 04 
2 03 
8 0. 
n=6,p=0.5 a 
0.35 2 
03 0.1 
0 
026 0123456 
. Number of successes 
80.15 ‘ 
© oa Exercise 10.3 page 177 
0.05 1. a) N(35, 10.5) ¢) N(100, 80) 
0 2. a) 0.9775 b) 0.1136 ) 0.4887 
0123456 
Number of successes d) 0.8263 e) 0.6288 
3. a) i) 0.6923 ii) 0.6975 
2. b) i) 0.0052 ii) 0.75% 
a : a lis | | s | 4, np>Sand n(1-p)>5, N(np, np(1 - p)) 
Prob | 0.531441] 0.354294 o.ose41s [0.01458 0.001215 | 5.4E-05 | 0.000001 
5. a) 0.2342 b) 0.0268 
ie n-6,p-0.5 6. a) B(50,0.25) b) 0.0111 ©) 0.8023 
‘8 7. a) 0.8689 
2 4 b) They are unlikely to be a random sample since 
3 they are patients who are more likely to be ill and 
S 03 F 
& to have raised blood pressure. 
0.2 
0.1 Summary exercise 10 page 178 
to 1. a) 3.2 b) 0.7939 ©) 0.0066 
012345 6 
Number of successes 2. a) 0.0547 b) 0 °. 0 


Answers 


3. a) 0.1252 b) 0.005 <) 0 6 a) i) 117 ii) 71 iii) 270 b) 30 


a) i) 0.850 ii) 0.350 7. a) 0.315 
b) 0.9502 ©) 0.425 b) 
a) 0.805 b) 0.0277 XX 1]2]3]4]5]6]8]9{10]12[15]16|20[25 
a) i) 0.0718 ii) 0.729 iii) 0.2364 Prob(/36)| 1} 2}2|3|2 2[2 9 |i] 23) 2 | | 
b) 0.001 
) ) 625;4497 d) S 
7. i) 0.558 ii) 0.013 
8. 0.065 Exam-style paper B page 184 
9. 0.189 Ds Peek 
2. a) median 78, LQ =70, UQ=84 
10. i) 0.0272 ii) 0.718 iii) 0.113 b) \ i 
Before exercise 


Exam-style paper A page 182 
1. x= 30.024; o= 0.461 
2. 


= 


After exercise 


5 
24 
2 50 60 70 80 90 100 110 
S38 
> (3 x 5) + (5 x 15) + 30f + (4 x 55) + (3 x 95) 
Bo 3. a) 35.8= eT 
Bi =f=10 
b) 263 
| 40 20 30 40 * 4. a) 0.544 b) 0.0032 
Distance run (km) 5. a) 0.04 b) 0.64 c) 0.455 
1 
3. a) o=0.950 b) 0.4164 6. a) Proof  b) s ©) 409 d) 0.295 
4. a) 0.159 b) 0.579 7 a) i) 3. ii) 28 iii) 243° b) 302400 
12 15 24 
2 5 9 F 
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Data sets 


Glossary 


Command words 


calculate Work out from given facts, figures 
and information. 


describe Give the characteristics and main 
features. 
determine Establish with certainty. 


evaluate Judge or calculate the quality/ 
importance/amount/value of something. 

explain Set out purposes/reasons/mechanisms, 
or make the relationship between things clear, 
with supporting evidence. 

identify Name/select/recognise. 

justify Support a case with evidence/argument. 


show that Provide structured evidence that 
leads to a given result. 


sketch Make a simple freehand drawing 
showing the key features. 


state Express in clear terms. 


verify Confirm that a given statement/result 
is true. 


Mathematical words 


Bernoulli trials A series of trials where each 
has two possible outcomes of interest, the trials 
are independent and the probabilities of the 
outcomes stay constant through the series. 

bias Statistical bias occurs when the sampling 
method used means that some parts of a 
population are likely to be under- or over- 
represented. 

binomial distribution A distribution used to 
model how often one outcome occurs in a series 
of Bernoulli trials. 


box-and-whisker plot A graphical 
representation of a distribution using five points 
— the median, upper and lower quartiles and the 
maximum and minimum values. 

categorical data Data that can be divided into 
distinct categories. They can be numerical or 
non-numerical. 


census A census gathers information about 
every member of a population. 


class interval In categorical data these are the 
groups used, e.g. ages 15-20. 


code Transform a set of data by a linear function, 
usually to make numerical work simpler. 


combinations The number of ways of 
choosing some items from a larger group of 
items if the order of selection does not matter. 


complementary events The complementary 
event of event A is that A won't happen, written 
as A’. 

compound event The outcome of more than 
one simple event considered as a single outcome, 
for example the total score when two fair dice are 
thrown. 


conditional probability This is the probability 
that an event B happens in the knowledge that 
event A has already happened. 


continuity correction A correction required 
when a continuous distribution (usually the 
normal) is used to approximate the binomial (or 
any other distribution that takes only discrete 
values). 


continuous probability distribution | Where 
the possible outcomes in a situation are 
continuous (for example heights or weights) then 
this describes the probability that the outcome 
lies in a particular range of values. 


continuous data A quantity such as weight 
that, unlike discrete data, is not restricted to 
defined values. In most situations the accuracy 
of the measurement determines how it is 
reported. 

data logging Collecting data automatically 
using some form of technology, e.g. a traffic 
counter. 

discrete data Data that can only take on 
defined values — e.g. a count of objects can 
only take values ‘0, ‘1’ , ‘2; ‘3’ etc. and not, for 
example, ‘3.73. 


Glossary 


empty set A set containing no elements. If B 
and Care sets with no common elements then 
the set of elements which belong to both B and C 
is empty. We write BNC=@ or BNC={}. Also 
called a null set. 

estimate When data is summarised in intervals, 
details of exact values are not known and we can 
only work out approximations (estimates) to 
summary statistics like the mean, median, etc. 


event A set of possible outcomes from an 
experiment. 
exhaustive A set of events is exhaustive if 


all possible outcomes occur in at least one of 

the sets. 

expected value (of a probability distribution) 
This is found by multiplying each value by its 
probability and then summing over all possible 
values of the random variable. This is written as 
u = E(X) and is sometimes called the mean of the 
probability distribution. 

experimental probability The number of 
times a particular outcome occurs expressed as a 
fraction (or decimal) of the number of tests. Also 
known as relative frequency. 

false positive A positive result in a screening 
test when the patient does not have the disease. 
frequency density The average rate at which 
observations occur in an interval (= frequency / 
interval width). 
frequency table 
list of values. 


A way of summarising a large 


Gaussian distribution Another name for the 
normal distribution, generally used in Physics. 
geometric distribution A distribution that 
represents the number of failures in a series of 
Bernoulli tests before you get a success. 
geometric series A series of numbers with a 
constant ratio between each of the terms. 
independent Two events that do not affect 
the outcome of each other are independent. 
This means that P(A|B)= P(A), i-e. knowing 
that B has happened does not give any further 
information as to whether or not A happens. 


Glossary 


interpolation Working out an estimate of 

a value within a group using proportional 
reasoning. 

interquartile range (IQR) The spread of the 
middle 50% of values (from 25% to 75%). 
intersection This is when two events A and B 
have to both happen, written as AN B. 


lower quartile The smallest of the three values 
which divide a data set (where the values are 
arranged in order) into four equal parts. 


mean _ The sum ofall the values divided by the 
number of values. 


median The middle value when the values are 
arranged in order. 

memorylessness ‘The property of a geometric 
distribution where the waiting time for an event 
to occur does not depend on how much time (or 
how many trials) have already occurred. 


mode The most commonly occurring value. 


mutually exclusive Two events A and Bare 
mutually exclusive if they cannot occur at the 
same time. 


negatively skewed A distribution skewed 
towards the upper end of the range. 


normal distribution A continuous 
distribution that is symmetric and infinite in 
both directions; 95% of values lie within two 
standard deviations of the mean, and 99% lie 
within three standard deviations of the mean. 


normal population A set of data that has (or is 
expected to have) a normal distribution. 


normal probability table A table of values 
for the standard normal distribution which can 
be used to deduce probabilities for any normal 
population. 


nullset See empty set. 


parameter Some property of a family of 
probability distributions. The value(s) of the 
parameter(s) defines the specific distribution to be 
considered. For example, the binomial distribution 
has two parameters (the number of trials, n, and 
the probability of a success on a trial, p). 


partition A group of sets that are exhaustive 
and mutually exclusive form a partition. 
Pascal’s triangle _A visual representation of 
the binomial coefficients. In general the nth row 
n 
consists of the value of("] for r=0,1,2,...,". 
permutations The number of ways a group of 
objects can be arranged. 
population The larger group from which a 
sample is drawn. 
positively skewed Describes a distribution 
skewed towards the lower end of the range. 
possibility space diagram _A table showing 
the possible outcomes when two things happen 
according to some rule. Also known as a sample 
space diagram. 
primary data Any data you have collected 
yourself. 
probability A measure of how likely 
something is to happen. 
probability distribution Describes all the 
possible outcomes in a situation along with the 
probability of each outcome, either in a list or by 
a formula. 
probability experiment An experiment that 
has unpredictable outcomes. 
probability function A formula describing the 
probability that a discrete random variable takes 
particular values. 


proxy A variable that is not in itself relevant, 
but stands in for something that is not directly 
measurable or is difficult to measure. 
qualitative |©Non-numerical data. 
quantitative Numerical data. 

quartile One of the three values which divide a 
data set (where the values are arranged in order) 
into four equal parts. 

random variable A quantity that can take any 
value determined by the outcome of a random 
event. 


range ‘The spread of values. 


relative frequency The number of times 

a particular outcome occurs expressed as a 
fraction (or decimal) of the number of tests. Also 
known as experimental probability. 

sample space _A list (in curly brackets { }) 
of the outcomes of a compound event. Each 
outcome is in its own pair of brackets ( ). For 
example, the sample space for tossing a coin 
twice is {(H,H); (H,T); (T,H); (T,T)}. 
sampling ‘The selection of a subset of the 
whole population being studied. 


secondary data Data collected by someone 
other than the person using the data. 


skewness The lack of symmetry in a distribution. 


spread ‘The amount of variation within a set of 
data. 

standard deviation ‘The square root of the 
variance. 

standard normal distribution A normal 
distribution with mean 0 and variance 1. 


standardised score One way to compare 
different populations is to standardise the scores 
by looking at their distance from the mean, then 
dividing by the size of the standard deviation. 
statistical experiment _ A statistical study in 
which the researchers make an intervention and 
measure the effect of this intervention on some 
outcome of interest. 

stem-and-leaf diagram Displays a data list 
visually to show the distribution, but keeps 

the detailed information; the stem defines the 
groups, and the leaf then shows the detailed 
values in order. 

symmetric Describes a distribution with 
equal-length tails, distributed evenly around the 
mean. 

tail probabilities The probability of an event 
occurring in the tails (e.g. upper and/or lower 
5%) of the distribution. 

theoretical probability The calculated probability 
ofa particular outcome. For example, with a fair 
coin toss the probability of ‘heads’ is 50%. 
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tree diagram A visual representation 

of a sequence of events, used to calculate 
probabilities. 

trial In experiments, you often do the same thing 
repeatedly. Each one is often referred to asa trial. 
union An outcome where either event A or 
event B (or both) can occur is described as a 
union, written AUB. 

upper quartile The largest of the three values 
which divide a data set (where the values are 
arranged in order) into four equal parts. 
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value (probability) The value assigned to the 
outcome of an event, so for the compound event 
(1, 5) from throwing two dice the value could be 
the sum of these, 6. 

variance The average of the squared distances 
from the mean. 


z-score A value that describes how many 
standard deviations a data point is from the 
mean. 
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